GPT-5.1 Thinking vs Qwen3-235B-A22B-Thinking-2507: Specs & Benchmark Comparison

CharacteristicGPT-5.1 ThinkingQwen3-235B-A22B-Thinking-2507
CompanyOpenAIAlibaba
Release DateNovember 11, 2025July 24, 2025
Parameters235B
MultimodalYesNo
Context (input)256K256K
Context (output)128K131K
Input Price / 1M$3.00$0.30
Output Price / 1M$12.00$3.00
Average Score0.90.9
Benchmarks
AIME 20250.90.9

Visual Benchmark Comparison

GPT-5.1 Thinking
Qwen3-235B-A22B-Thinking-2507
AIME 20250.9 vs 0.9
0.9
0.9

Verdict

Both models show equal results — the choice depends on your specific use case.

Overall Performance

Both models show comparable average scores: GPT-5.1 Thinking — 0.9, Qwen3-235B-A22B-Thinking-2507 — 0.9.

API Cost

Qwen3-235B-A22B-Thinking-2507 is 4.5x cheaper: input $0.30/1M vs $3.00/1M tokens.

Context Window

Same context size: 256K tokens.

Recency

GPT-5.1 Thinking is newer: released 11/11/2025 vs 7/24/2025.

More About These Models

Related Comparisons

The GPT-5.1 Thinking and Qwen3-235B-A22B-Thinking-2507 comparison is updated for 2026. Data includes benchmark results, API pricing, context window size and other specifications. For more detailed information, visit the GPT-5.1 Thinking or Qwen3-235B-A22B-Thinking-2507 page. See also the complete list of AI model comparisons.