STET
■
Leaderboard
Run experiment
Opus 4.7 vs Old Opus 4.6 vs New Opus 4.6
April 17, 2026
Your AGENTS.md is the highest-leverage code you're not testing
April 8, 2026
Your AI coding benchmark is hiding a 2x quality gap
March 14, 2026