Zhipu AI logo

GLM-4.7-Flash

Zhipu AI

GLM-4.7 Flash is a fast, cost-efficient variant of the GLM-4.7 model by Zhipu AI. It delivers competitive performance in reasoning and coding at significantly reduced latency and cost, optimized for high-throughput applications.

Key Specifications

Parameters
30.0B
Context
128.0K
Release Date
January 18, 2026
Average Score
60.5%

Timeline

Key dates in the model's history
Announcement
January 18, 2026
Last Update
January 22, 2026
Today
March 25, 2026

Technical Specifications

Parameters
30.0B
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval

Pricing & Availability

Input (per 1M tokens)
$0.07
Output (per 1M tokens)
$0.40
Max Input Tokens
128.0K
Max Output Tokens
16.4K
Supported Features
Function CallingStructured OutputCode ExecutionWeb SearchBatch InferenceFine-tuning

Benchmark Results

Model performance metrics across various tests and benchmarks

Programming

Programming skills tests
SWE-Bench Verified
Self-reported
59.0%

Reasoning

Logical reasoning and analysis
GPQA
Diamond subsetSelf-reported
75.0%

Other Tests

Specialized benchmarks
AIME 2025
Self-reported
92.0%
TAU-Bench
Self-reported
80.0%
BrowseComp
Self-reported
43.0%
HLE
Self-reported
14.0%

License & Metadata

License
mit
Announcement Date
January 18, 2026
Last Updated
January 22, 2026

Compare GLM-4.7-Flash

All comparisons

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.