Model Horizon
DashboardModelsCompare
© 2026 Model Horizon
About|Terms
SYS.v0.1.0
Skip to content
  1. Home
  2. /Compare

Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Swap
Specifications
SpecGemini 2.5 Flash
Google
Grok 4.1
xAI
Tiermidfrontier
Release Date2025-06-172025-11-17
Context Window
1049K
2000K
Max Output
66K
16K
Input Price (/1M)
$0.3
$0.2
Output Price (/1M)
$2.5
$0.5
Arena Elo
1,410
1,465
MMLU
83.5%
90.2%
GPQA
65.8%
88%
MATH
82.1%
—
HumanEval
90.3%
92%
SWE-bench—
65%
Capabilitiesvision, tool-use, code, reasoning, audiovision, tool-use, code, reasoning

Benchmark Comparison

Gemini 2.5 Flash
Grok 4.1
Arena Elo
1,410
1,465
MMLU
83.5%
90.2%
GPQA
65.8%
88%
MATH
82.1%
N/A
HumanEval
90.3%
92%
SWE-bench
N/A
65%
Cost Estimate

Cost Calculator

Gemini 2.5 Flash
$4.20/mo$0.1400/day
Grok 4.1
$1.05/mo$0.0350/day