Qwen3 VL 32B Thinking
MultimodalQwen3-VL is a large multimodal model combining vision, language, and reasoning to achieve human-level perception and cognitive abilities. The 33B parameter Thinking version leads in multimodal reasoning and STEM tasks with OCR support, video understanding, and agentic interaction.
Key Specifications
Parameters
33.0B
Context
-
Release Date
September 21, 2025
Average Score
14450.8%
Timeline
Key dates in the model's history
Announcement
September 21, 2025
Last Update
February 17, 2026
Today
March 25, 2026
Technical Specifications
Parameters
33.0B
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval
Benchmark Results
Model performance metrics across various tests and benchmarks
Other Tests
Specialized benchmarks
OCRBench
• Self-reported
MM-MT-Bench
• Self-reported
DocVQAtest
• Self-reported
ScreenSpot
• Self-reported
MMLU-Redux
• Self-reported
MMBench-V1.1
EN_V1.1 • Self-reported
License & Metadata
License
apache-2.0
Announcement Date
September 21, 2025
Last Updated
February 17, 2026
Compare Qwen3 VL 32B Thinking
All comparisonsSimilar Models
All ModelsQwen2-VL-72B-Instruct
Alibaba
MM73.4B
Released:Aug 2024
Qwen2.5 VL 72B Instruct
Alibaba
MM72.0B
Released:Jan 2025
Qwen2.5 VL 32B Instruct
Alibaba
MM33.5B
Best score:0.9 (HumanEval)
Released:Feb 2025
QvQ-72B-Preview
Alibaba
MM73.4B
Released:Dec 2024
Qwen3.5-397B-A17B
Alibaba
MM397.0B
Released:Feb 2026
Qwen2.5 VL 7B Instruct
Alibaba
MM8.3B
Released:Jan 2025
Qwen2.5-Omni-7B
Alibaba
MM7.0B
Best score:0.8 (HumanEval)
Released:Mar 2025
Qwen3.5 27B
Alibaba
27.0B
Released:Mar 2026
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.