Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Model A

Swap

Model B

Specifications

Spec	Claude Sonnet 4.5 Anthropic	Gemini 3.1 Pro Google
Tier	frontier	frontier
Release Date	2025-09-29	2026-02-19
Context Window	200K	1049K
Max Output	64K	66K
Input Price (/1M)	$3	$2
Output Price (/1M)	$15	$12
Arena Elo	1,450	1,500
MMLU	88.7%	92.6%
GPQA	83.4%	94.3%
MATH	87%	96.8%
HumanEval	95%	94.6%
SWE-bench	55.8%	80.6%
AIME	58%	91.2%
SimpleQA	30.8%	79.6%
Capabilities	vision, tool-use, code, extended-thinking	vision, tool-use, code, reasoning, agentic, audio

Claude Sonnet 4.5

Gemini 3.1 Pro

Arena Elo

1,450

1,500

MMLU

88.7%

92.6%

GPQA

83.4%

94.3%

MATH

87%

96.8%

HumanEval

95%

94.6%

SWE-bench

55.8%

80.6%

AIME

58%

91.2%

SimpleQA

30.8%

79.6%

Cost Estimate

Tokens per day (input + output, 50/50 split)

Claude Sonnet 4.5

$27.00/mo$0.9000/day

Gemini 3.1 Pro

$21.00/mo$0.7000/day

Try These Models