Model Horizon
DashboardModelsCompareBenchmarks
© 2026 Model Horizon
About|Terms
SYS.v0.1.0
Skip to content
  1. Home
  2. /Compare

Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Swap
Specifications
SpecGPT-5.3 Codex
OpenAI
Grok 4.1
xAI
Tierfrontierfrontier
Release Date2026-02-052025-11-17
Context Window
400K
2000K
Max Output
128K
16K
Input Price (/1M)
$1.75
$0.2
Output Price (/1M)
$14
$0.5
Arena Elo
1,480
1,465
MMLU
93%
90.2%
GPQA
81%
88%
MATH
96%
—
HumanEval
93%
92%
SWE-bench
80%
65%
AIME
94%
78%
SimpleQA
58%
38%
Capabilitiesvision, tool-use, code, reasoning, agenticvision, tool-use, code, reasoning

Benchmark Comparison

GPT-5.3 Codex
Grok 4.1
Arena Elo
1,480
1,465
MMLU
93%
90.2%
GPQA
81%
88%
MATH
96%
N/A
HumanEval
93%
92%
SWE-bench
80%
65%
AIME
94%
78%
SimpleQA
58%
38%
Cost Estimate

Cost Calculator

GPT-5.3 Codex
$23.63/mo$0.7875/day
Grok 4.1
$1.05/mo$0.0350/day
Try These Models
Try GPT-5.3 CodexTry Grok 4.1