Qwen3-Next-80B-A3B-Instruct
Qwen3-Next-80B-A3B-Instruct is the first model in the Qwen3-Next series with breakthrough architectural innovations. Uses hybrid attention (Gated DeltaNet + Gated Attention) for efficient ultra-long context modeling, MoE with high sparsity (512 experts, 10 active + 1 shared), and multi-token prediction. 80 billion parameters (3 billion active), trained on 15T tokens. Outperforms Qwen3-32B-Base at 10% of the training cost. Context support up to 256K (expandable to 1M with YaRN). Apache 2.0 license.
Key Specifications
Timeline
Technical Specifications
Pricing & Availability
Benchmark Results
Model performance metrics across various tests and benchmarks
Other Tests
License & Metadata
Compare Qwen3-Next-80B-A3B-Instruct
All comparisonsSimilar Models
All ModelsQwen2 72B Instruct
Alibaba
Qwen3 30B A3B
Alibaba
Qwen2.5 14B Instruct
Alibaba
QwQ-32B-Preview
Alibaba
Qwen3.5 27B
Alibaba
Qwen3.5 35B A3B
Alibaba
Qwen2.5 72B Instruct
Alibaba
Qwen3 32B
Alibaba
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.