Key Specifications
Parameters
560.0B
Context
-
Release Date
September 21, 2025
Average Score
85.1%
Timeline
Key dates in the model's history
Announcement
September 21, 2025
Last Update
February 17, 2026
Today
March 26, 2026
Technical Specifications
Parameters
560.0B
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval
Benchmark Results
Model performance metrics across various tests and benchmarks
Programming
Programming skills tests
SWE-Bench Verified
• Self-reported
Reasoning
Logical reasoning and analysis
GPQA
• Self-reported
Other Tests
Specialized benchmarks
MATH-500
Mean@1 • Self-reported
ZebraLogic
Mean@1 • Self-reported
AIME 2024
Mean@32 • Self-reported
AIME 2025
Mean@32 • Self-reported
MMLU-Redux
• Self-reported
Tau2 Telecom
Mean@4 • Self-reported
MMLU-Pro
• Self-reported
Tau2 Retail
Mean@4 • Self-reported
License & Metadata
License
mit
Announcement Date
September 21, 2025
Last Updated
February 17, 2026
Compare LongCat-Flash-Thinking
All comparisonsSimilar Models
All ModelsLongCat-Flash-Chat
Meituan
560.0B
Best score:0.9 (MMLU)
Released:Aug 2025
LongCat-Flash-Thinking-2601
Meituan
560.0B
Best score:1.0 (TAU)
Released:Jan 2026
LongCat-Flash-Lite
Meituan
68.5B
Best score:0.9 (MMLU)
Released:Feb 2026
GLM-4.5-Air
Zhipu AI
106.0B
Best score:0.8 (TAU)
Released:Jul 2025
Kimi K2-Thinking-0905
Moonshot AI
1.0T
Best score:0.8 (GPQA)
Released:Sep 2025
Price:$0.60/1M tokens
Nemotron 3 Super (120B A12B)
NVIDIA
120.0B
Best score:0.8 (GPQA)
Released:Mar 2026
Mistral Large 2
Mistral AI
123.0B
Best score:0.9 (HumanEval)
Released:Jul 2024
Price:$2.00/1M tokens
MiMo-V2-Flash
Xiaomi
309.0B
Best score:0.8 (GPQA)
Released:Dec 2025
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.