GPT-5.1 Thinking vs Mercury 2: Specs & Benchmark Comparison

CharacteristicGPT-5.1 ThinkingMercury 2
CompanyOpenAIInception
Release DateNovember 11, 2025February 1, 2026
Parameters
MultimodalYesNo
Context (input)256K
Context (output)128K
Input Price / 1M$3.00
Output Price / 1M$12.00
Average Score0.90.7
Benchmarks
GPQA0.90.7
AIME 20250.90.9

Visual Benchmark Comparison

GPT-5.1 Thinking
Mercury 2
GPQA0.9 vs 0.7
0.9
0.7
AIME 20250.9 vs 0.9
0.9
0.9

Verdict

Mercury 2 leads in 1 out of 2 comparison categories.

Overall Performance

Both models show comparable average scores: GPT-5.1 Thinking — 0.9, Mercury 2 — 0.7.

Recency

Mercury 2 is newer: released 2/1/2026 vs 11/11/2025.

More About These Models

Related Comparisons

The GPT-5.1 Thinking and Mercury 2 comparison is updated for 2026. Data includes benchmark results, API pricing, context window size and other specifications. For more detailed information, visit the GPT-5.1 Thinking or Mercury 2 page. See also the complete list of AI model comparisons.