GPT-5.1 Medium vs Mercury 2: Specs & Benchmark Comparison

Characteristic	GPT-5.1 Medium	Mercury 2
Company	OpenAI	Inception
Release Date	November 11, 2025	February 1, 2026
Parameters	—	—
Multimodal	Yes	No
Context (input)	256K	—
Context (output)	64K	—
Input Price / 1M	$1.00	—
Output Price / 1M	$4.00	—
Average Score	1.0	0.7
Benchmarks
AIME 2025	1.0	0.9

Visual Benchmark Comparison

GPT-5.1 Medium

Mercury 2

AIME 20251.0 vs 0.9

1.0

0.9

Verdict

Mercury 2 leads in 1 out of 2 comparison categories.

Overall Performance

Both models show comparable average scores: GPT-5.1 Medium — 1.0, Mercury 2 — 0.7.

Recency

Mercury 2 is newer: released 2/1/2026 vs 11/11/2025.

More About These Models

GPT-5.1 Medium

OpenAI — specs, benchmarks, API

Mercury 2

Inception — specs, benchmarks, API

Related Comparisons

GPT-5.1 Medium vs GPT-5.2 GPT-5.1 Codex High vs GPT-5.1 Medium GPT-5.1 High vs GPT-5.1 Medium GPT-5.1 Medium vs GPT-5.4 GPT-5 High vs GPT-5.1 Medium GPT-5.1 Medium vs GPT-5.1 Thinking

All model comparisons →

Frequently Asked Questions

Which is better for coding — GPT-5.1 Medium or Mercury 2?

Direct comparison on the SWE-Bench benchmark is not available. We recommend reviewing other metrics on the comparison page.

Which model is cheaper — GPT-5.1 Medium or Mercury 2?

API pricing data is available on the individual model pages.

Which has a larger context window — GPT-5.1 Medium or Mercury 2?

Context window data is available on the individual model pages.

The GPT-5.1 Medium and Mercury 2 comparison is updated for 2026. Data includes benchmark results, API pricing, context window size and other specifications. For more detailed information, visit the GPT-5.1 Medium or Mercury 2 page. See also the complete list of AI model comparisons.