LongCat-Flash-Thinking vs Qwen3-235B-A22B-Thinking-2507: Specs & Benchmark Comparison

CharacteristicLongCat-Flash-ThinkingQwen3-235B-A22B-Thinking-2507
CompanyMeituanAlibaba
Release DateSeptember 21, 2025July 24, 2025
Parameters560B235B
MultimodalNoNo
Context (input)256K
Context (output)131K
Input Price / 1M$0.30
Output Price / 1M$3.00
Average Score0.90.9
Benchmarks
MMLU-Redux0.90.9
AIME 20250.90.9

Visual Benchmark Comparison

LongCat-Flash-Thinking
Qwen3-235B-A22B-Thinking-2507
MMLU-Redux0.9 vs 0.9
0.9
0.9
AIME 20250.9 vs 0.9
0.9
0.9

Verdict

LongCat-Flash-Thinking leads in 1 out of 2 comparison categories.

Overall Performance

Both models show comparable average scores: LongCat-Flash-Thinking — 0.9, Qwen3-235B-A22B-Thinking-2507 — 0.9.

Recency

LongCat-Flash-Thinking is newer: released 9/21/2025 vs 7/24/2025.

More About These Models

Related Comparisons

The LongCat-Flash-Thinking and Qwen3-235B-A22B-Thinking-2507 comparison is updated for 2026. Data includes benchmark results, API pricing, context window size and other specifications. For more detailed information, visit the LongCat-Flash-Thinking or Qwen3-235B-A22B-Thinking-2507 page. See also the complete list of AI model comparisons.