Alibaba logo

Qwen3-Next-80B-A3B-Instruct

Alibaba

Qwen3-Next-80B-A3B-Instruct is the first model in the Qwen3-Next series with breakthrough architectural innovations. Uses hybrid attention (Gated DeltaNet + Gated Attention) for efficient ultra-long context modeling, MoE with high sparsity (512 experts, 10 active + 1 shared), and multi-token prediction. 80 billion parameters (3 billion active), trained on 15T tokens. Outperforms Qwen3-32B-Base at 10% of the training cost. Context support up to 256K (expandable to 1M with YaRN). Apache 2.0 license.

Key Specifications

Parameters
80.0B
Context
65.5K
Release Date
September 9, 2025
Average Score
87.0%

Timeline

Key dates in the model's history
Announcement
September 9, 2025
Last Update
February 12, 2026
Today
March 25, 2026

Technical Specifications

Parameters
80.0B
Training Tokens
15.0T tokens
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval

Pricing & Availability

Input (per 1M tokens)
$0.15
Output (per 1M tokens)
$1.50
Max Input Tokens
65.5K
Max Output Tokens
65.5K
Supported Features
Function CallingStructured OutputCode ExecutionWeb SearchBatch InferenceFine-tuning

Benchmark Results

Model performance metrics across various tests and benchmarks

Other Tests

Specialized benchmarks
MMLU-Redux
Self-reported
91.0%
MultiPL-E
Self-reported
88.0%
IFEval
Self-reported
88.0%
WritingBench
Self-reported
87.0%
Creative Writing v3
Self-reported
85.0%
Arena-Hard v2
Evaluation GPT-4.1 by win rateSelf-reported
83.0%

License & Metadata

License
apache-2.0
Announcement Date
September 9, 2025
Last Updated
February 12, 2026

Compare Qwen3-Next-80B-A3B-Instruct

All comparisons

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.