GPT-5.1 Codex High vs Qwen3-235B-A22B-Thinking-2507: Specs & Benchmark Comparison

CharacteristicGPT-5.1 Codex HighQwen3-235B-A22B-Thinking-2507
CompanyOpenAIAlibaba
Release DateNovember 11, 2025July 24, 2025
Parameters235B
MultimodalYesNo
Context (input)400K256K
Context (output)128K131K
Input Price / 1M$1.25$0.30
Output Price / 1M$10.00$3.00
Average Score1.00.9
Benchmarks
AIME 20251.00.9

Visual Benchmark Comparison

GPT-5.1 Codex High
Qwen3-235B-A22B-Thinking-2507
AIME 20251.0 vs 0.9
1.0
0.9

Verdict

GPT-5.1 Codex High leads in 2 out of 4 comparison categories.

Overall Performance

Both models show comparable average scores: GPT-5.1 Codex High — 1.0, Qwen3-235B-A22B-Thinking-2507 — 0.9.

API Cost

Qwen3-235B-A22B-Thinking-2507 is 3.4x cheaper: input $0.30/1M vs $1.25/1M tokens.

Context Window

GPT-5.1 Codex High supports a larger context: 400K vs 256K tokens.

Recency

GPT-5.1 Codex High is newer: released 11/11/2025 vs 7/24/2025.

More About These Models

Related Comparisons

Frequently Asked Questions

Which is better for coding — GPT-5.1 Codex High or Qwen3-235B-A22B-Thinking-2507?
Direct comparison on the SWE-Bench benchmark is not available. We recommend reviewing other metrics on the comparison page.
Which model is cheaper — GPT-5.1 Codex High or Qwen3-235B-A22B-Thinking-2507?
Qwen3-235B-A22B-Thinking-2507 is cheaper for input: $0.30 per 1M tokens vs $1.25.
Which has a larger context window — GPT-5.1 Codex High or Qwen3-235B-A22B-Thinking-2507?
GPT-5.1 Codex High supports a larger context: 400,000 tokens vs 256,000.

The GPT-5.1 Codex High and Qwen3-235B-A22B-Thinking-2507 comparison is updated for 2026. Data includes benchmark results, API pricing, context window size and other specifications. For more detailed information, visit the GPT-5.1 Codex High or Qwen3-235B-A22B-Thinking-2507 page. See also the complete list of AI model comparisons.