Side-by-side comparison of AI model specs, benchmarks, and pricing.
| Spec | Claude Sonnet 4.5 Anthropic | Gemini 3.1 Pro Google |
|---|---|---|
| Tier | frontier | frontier |
| Release Date | 2025-09-29 | 2026-02-19 |
| Context Window | 200K | 1049K |
| Max Output | 64K | 66K |
| Input Price (/1M) | $3 | $2 |
| Output Price (/1M) | $15 | $12 |
| Arena Elo | 1,450 | 1,500 |
| MMLU | 88.7% | 92.6% |
| GPQA | 83.4% | 94.3% |
| MATH | 87% | 96.8% |
| HumanEval | 95% | 94.6% |
| SWE-bench | 55.8% | 80.6% |
| AIME | 58% | 91.2% |
| SimpleQA | 30.8% | 79.6% |
| Capabilities | vision, tool-use, code, extended-thinking | vision, tool-use, code, reasoning, agentic, audio |