DeepSeek-V3.1
DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with two-phase long context extension (32K phase: 630B tokens, 128K phase: 209B tokens), it has 671B total parameters with 37B activated. Key improvements include smart tool calling, high thinking efficiency comparable to DeepSeek-R1-0528, and FP8 format.
Key Specifications
Timeline
Technical Specifications
Pricing & Availability
Benchmark Results
Model performance metrics across various tests and benchmarks
Reasoning
Other Tests
License & Metadata
Similar Models
All ModelsDeepSeek-V3 0324
DeepSeek
DeepSeek R1 Zero
DeepSeek
DeepSeek-V3
DeepSeek
DeepSeek-V3.2 (Thinking)
DeepSeek
DeepSeek-V3.2-Exp
DeepSeek
DeepSeek-R1
DeepSeek
DeepSeek-V3.2-Speciale
DeepSeek
DeepSeek-R1-0528
DeepSeek
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.