GLM-4.7-Flash vs Kimi K2-Thinking-0905: Specs & Benchmark Comparison

CharacteristicGLM-4.7-FlashKimi K2-Thinking-0905
CompanyZhipu AIMoonshot AI
Release DateJanuary 18, 2026September 4, 2025
Parameters30B1.0T
MultimodalNoNo
Context (input)128K262K
Context (output)16K66K
Input Price / 1M$0.07$0.60
Output Price / 1M$0.40$2.40
Average Score0.60.9
Benchmarks
GPQA0.80.8
AIME 20250.91.0

Visual Benchmark Comparison

GLM-4.7-Flash
Kimi K2-Thinking-0905
GPQA0.8 vs 0.8
0.8
0.8
AIME 20250.9 vs 1.0
0.9
1.0

Verdict

GLM-4.7-Flash leads in 2 out of 4 comparison categories.

Overall Performance

Both models show comparable average scores: GLM-4.7-Flash — 0.6, Kimi K2-Thinking-0905 — 0.9.

API Cost

GLM-4.7-Flash is 6.4x cheaper: input $0.07/1M vs $0.60/1M tokens.

Context Window

Kimi K2-Thinking-0905 supports a larger context: 262K vs 128K tokens.

Recency

GLM-4.7-Flash is newer: released 1/18/2026 vs 9/4/2025.

More About These Models

Related Comparisons

The GLM-4.7-Flash and Kimi K2-Thinking-0905 comparison is updated for 2026. Data includes benchmark results, API pricing, context window size and other specifications. For more detailed information, visit the GLM-4.7-Flash or Kimi K2-Thinking-0905 page. See also the complete list of AI model comparisons.