Side-by-side comparison of AI model specs, benchmarks, and pricing.
| Spec | Claude Sonnet 4.6 Anthropic | GPT-5.3 Codex OpenAI |
|---|---|---|
| Tier | frontier | frontier |
| Release Date | 2026-02-17 | 2026-02-05 |
| Context Window | 200K | 400K |
| Max Output | 64K | 128K |
| Input Price (/1M) | $3 | $1.75 |
| Output Price (/1M) | $15 | $14 |
| Arena Elo | 1,470 | 1,480 |
| MMLU | 91% | 93% |
| GPQA | 88% | 81% |
| MATH | 97.8% | 96% |
| HumanEval | 96% | 93% |
| SWE-bench | 70.3% | 80% |
| AIME | 78% | 94% |
| SimpleQA | 39.5% | 58% |
| Capabilities | vision, tool-use, code, extended-thinking | vision, tool-use, code, reasoning, agentic |