Pinned W1→W2 comparison
Zod (TypeScript) · 28-task shared slice
“Zod (TypeScript) is tightly contested: GPT-5.4 edges out GPT-5.3 Codex at 79% vs 75% — but quality separates them: GPT-5.4 leads equivalence at 45%; GPT-5.4 costs 12x less.”
Updated Apr 17, 2026·463 runs·3 repos·Judge: GPT-5.3 Codex
28 tasksWeek 3