Model Horizon
DashboardModelsCompareBenchmarks
© 2026 Model Horizon
About|Terms
SYS.v0.1.10
Skip to content
  1. Home
  2. /Compare

Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Swap
Specifications
SpecGrok 4
xAI
Gemini 3.1 Pro
Google
Tierfrontierfrontier
Release Date2025-07-092026-02-19
Context Window
256K
1049K
Max Output
8K
66K
Input Price (/1M)
$3
$2
Output Price (/1M)
$15
$12
Arena Elo
1,410
1,500
MMLU
87.5%
92.6%
GPQA
87.5%
94.3%
MATH
91.7%
96.8%
HumanEval
90%
94.6%
SWE-bench
48%
80.6%
AIME
72%
91.2%
SimpleQA
34.2%
79.6%
Capabilitiesvision, tool-use, code, reasoningvision, tool-use, code, reasoning, agentic, audio

Benchmark Comparison

Grok 4
Gemini 3.1 Pro
Arena Elo
1,410
1,500
MMLU
87.5%
92.6%
GPQA
87.5%
94.3%
MATH
91.7%
96.8%
HumanEval
90%
94.6%
SWE-bench
48%
80.6%
AIME
72%
91.2%
SimpleQA
34.2%
79.6%
Cost Estimate

Cost Calculator

Grok 4
$27.00/mo$0.9000/day
Gemini 3.1 Pro
$21.00/mo$0.7000/day
Try These Models
Try Grok 4Try Gemini 3.1 Pro