NVIDIA logo

Nemotron 3 Super (120B A12B)

NVIDIA

Nemotron-3 Super 120B-A12B is a Mixture-of-Experts model by NVIDIA with 120 billion total parameters and 12 billion active parameters. It balances high performance with computational efficiency, delivering strong results in reasoning, coding, and knowledge tasks while requiring only a fraction of the total parameters per forward pass.

Key Specifications

Parameters
120.0B
Context
-
Release Date
March 1, 2026
Average Score
88.5%

Timeline

Key dates in the model's history
Announcement
March 1, 2026
Last Update
March 21, 2026
Today
March 26, 2026

Technical Specifications

Parameters
120.0B
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval

Benchmark Results

Model performance metrics across various tests and benchmarks

Reasoning

Logical reasoning and analysis
GPQA
With toolsSelf-reported
83.0%

Other Tests

Specialized benchmarks
HMMT 2025
With toolsSelf-reported
95.0%
RULER
100 @ 1MSelf-reported
92.0%
AIME 2025
Without toolsSelf-reported
90.0%
WMT24++
en→xxSelf-reported
87.0%
MMLU-Pro
Self-reported
84.0%

License & Metadata

License
nvidia-open-model
Announcement Date
March 1, 2026
Last Updated
March 21, 2026

Compare Nemotron 3 Super (120B A12B)

All comparisons

Articles about Nemotron 3 Super (120B A12B)

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.