Model Horizon
DashboardModelsCompareBenchmarks
© 2026 Model Horizon
About|Terms
SYS.v0.1.10
Skip to content
  1. Home
  2. /Compare

Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Swap
Specifications
SpecClaude Opus 4.6
Anthropic
Gemini 2.5 Flash
Google
Tierfrontiermid
Release Date2026-02-052025-06-17
Context Window
200K
1049K
Max Output
128K
66K
Input Price (/1M)
$5
$0.3
Output Price (/1M)
$25
$2.5
Arena Elo
1,496
1,410
MMLU
92.5%
83.5%
GPQA
91.3%
65.8%
MATH
97.6%
82.1%
HumanEval
97%
90.3%
SWE-bench
72.5%
—
AIME
83.3%
58%
SimpleQA
43.2%
28.3%
Capabilitiesvision, tool-use, code, extended-thinking, agenticvision, tool-use, code, reasoning, audio

Benchmark Comparison

Claude Opus 4.6
Gemini 2.5 Flash
Arena Elo
1,496
1,410
MMLU
92.5%
83.5%
GPQA
91.3%
65.8%
MATH
97.6%
82.1%
HumanEval
97%
90.3%
SWE-bench
72.5%
N/A
AIME
83.3%
58%
SimpleQA
43.2%
28.3%
Cost Estimate

Cost Calculator

Claude Opus 4.6
$45.00/mo$1.5000/day
Gemini 2.5 Flash
$4.20/mo$0.1400/day
Try These Models
Try Claude Opus 4.6Try Gemini 2.5 Flash