Compare Models

Side-by-side comparison of AI model specs, benchmarks, and pricing.

Model A

Swap

Model B

Specifications

Spec	Claude Opus 4.6 Anthropic	Gemini 2.5 Flash Google
Tier	frontier	mid
Release Date	2026-02-05	2025-06-17
Context Window	200K	1049K
Max Output	128K	66K
Input Price (/1M)	$5	$0.3
Output Price (/1M)	$25	$2.5
Arena Elo	1,496	1,410
MMLU	92.5%	83.5%
GPQA	91.3%	65.8%
MATH	97.6%	82.1%
HumanEval	97%	90.3%
SWE-bench	72.5%	—
AIME	83.3%	58%
SimpleQA	43.2%	28.3%
Capabilities	vision, tool-use, code, extended-thinking, agentic	vision, tool-use, code, reasoning, audio

Claude Opus 4.6

Gemini 2.5 Flash

Arena Elo

1,496

1,410

MMLU

92.5%

83.5%

GPQA

91.3%

65.8%

MATH

97.6%

82.1%

HumanEval

97%

90.3%

SWE-bench

72.5%

N/A

AIME

83.3%

58%

SimpleQA

43.2%

28.3%

Cost Estimate

Tokens per day (input + output, 50/50 split)

Claude Opus 4.6

$45.00/mo$1.5000/day

Gemini 2.5 Flash

$4.20/mo$0.1400/day

Try These Models