Model Horizon
DashboardModelsCompareBenchmarks
© 2026 Model Horizon
About|Terms
SYS.v0.1.0
Skip to content
  1. Home
  2. /Compare

Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Swap
Specifications
SpecGemini 3.1 Pro
Google
Grok 4.1
xAI
Tierfrontierfrontier
Release Date2026-02-192025-11-17
Context Window
1049K
2000K
Max Output
66K
16K
Input Price (/1M)
$2
$0.2
Output Price (/1M)
$12
$0.5
Arena Elo
1,500
1,465
MMLU
92.6%
90.2%
GPQA
94.3%
88%
MATH
96.8%
—
HumanEval
94.6%
92%
SWE-bench
80.6%
65%
AIME
91.2%
78%
SimpleQA
79.6%
38%
Capabilitiesvision, tool-use, code, reasoning, agentic, audiovision, tool-use, code, reasoning

Benchmark Comparison

Gemini 3.1 Pro
Grok 4.1
Arena Elo
1,500
1,465
MMLU
92.6%
90.2%
GPQA
94.3%
88%
MATH
96.8%
N/A
HumanEval
94.6%
92%
SWE-bench
80.6%
65%
AIME
91.2%
78%
SimpleQA
79.6%
38%
Cost Estimate

Cost Calculator

Gemini 3.1 Pro
$21.00/mo$0.7000/day
Grok 4.1
$1.05/mo$0.0350/day
Try These Models
Try Gemini 3.1 ProTry Grok 4.1