Nemotron 3 Super (120B A12B)
Nemotron-3 Super 120B-A12B is a Mixture-of-Experts model by NVIDIA with 120 billion total parameters and 12 billion active parameters. It balances high performance with computational efficiency, delivering strong results in reasoning, coding, and knowledge tasks while requiring only a fraction of the total parameters per forward pass.
Key Specifications
Parameters
120.0B
Context
-
Release Date
March 1, 2026
Average Score
88.5%
Timeline
Key dates in the model's history
Announcement
March 1, 2026
Last Update
March 21, 2026
Today
March 26, 2026
Technical Specifications
Parameters
120.0B
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval
Benchmark Results
Model performance metrics across various tests and benchmarks
Reasoning
Logical reasoning and analysis
GPQA
With tools • Self-reported
Other Tests
Specialized benchmarks
HMMT 2025
With tools • Self-reported
RULER
100 @ 1M • Self-reported
AIME 2025
Without tools • Self-reported
WMT24++
en→xx • Self-reported
MMLU-Pro
— • Self-reported
License & Metadata
License
nvidia-open-model
Announcement Date
March 1, 2026
Last Updated
March 21, 2026
Compare Nemotron 3 Super (120B A12B)
All comparisonsArticles about Nemotron 3 Super (120B A12B)
Similar Models
All ModelsLlama 3.1 Nemotron Ultra 253B v1
NVIDIA
253.0B
Best score:0.8 (GPQA)
Released:Apr 2025
Nemotron 3 Nano (30B A3B)
NVIDIA
32.0B
Best score:0.8 (GPQA)
Released:Dec 2025
Price:$0.06/1M tokens
Llama 3.1 Nemotron 70B Instruct
NVIDIA
70.0B
Best score:0.8 (MMLU)
Released:Oct 2024
DeepSeek-V3.2-Exp
DeepSeek
685.0B
Best score:0.8 (GPQA)
Released:Sep 2025
Price:$0.27/1M tokens
GLM-4.5
Zhipu AI
355.0B
Best score:0.8 (GPQA)
Released:Jul 2025
Price:$0.60/1M tokens
DeepSeek-V3.1
DeepSeek
671.0B
Best score:0.8 (GPQA)
Released:Jan 2025
Price:$0.27/1M tokens
MiniMax M2
MiniMax
230.0B
Best score:0.8 (GPQA)
Released:Oct 2025
Price:$1.00/1M tokens
Llama 3.1 405B Instruct
Meta
405.0B
Best score:1.0 (ARC)
Released:Jul 2024
Price:$3.50/1M tokens
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.
