DeepSeek logo

DeepSeek VL2 Small

Multimodal
DeepSeek

An advanced series of large multimodal Mixture-of-Experts (MoE) Vision-Language models that significantly surpasses its predecessor DeepSeek-VL. DeepSeek-VL2 demonstrates superior capabilities across various tasks including but not limited to visual question answering, optical character recognition, document/table/chart understanding, and visual grounding.

Key Specifications

Parameters
16.0B
Context
-
Release Date
December 13, 2024
Average Score
69.6%

Timeline

Key dates in the model's history
Announcement
December 13, 2024
Last Update
July 19, 2025
Today
March 25, 2026

Technical Specifications

Parameters
16.0B
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval

Benchmark Results

Model performance metrics across various tests and benchmarks

Multimodal

Working with images and visual data
AI2D
testSelf-reported
80.0%
ChartQA
testSelf-reported
84.5%
DocVQA
testSelf-reported
92.3%
MathVista
testminiSelf-reported
60.7%
MMMU
valSelf-reported
48.0%

Other Tests

Specialized benchmarks
InfoVQA
testSelf-reported
75.8%
MMBench
ru testSelf-reported
80.3%
MMBench-V1.1
cn testSelf-reported
79.3%
MME
Standard Evaluation AI: Standard evaluationSelf-reported
21.2%
MMStar
Standard evaluation AI: Standard evaluationSelf-reported
57.0%
MMT-Bench
Standard evaluation AI: I'm an AI assistant that answers questions.Self-reported
62.9%
OCRBench
Standard evaluation AI: Standard evaluationSelf-reported
83.4%
RealWorldQA
Standard evaluation AI: ChatGPT assisted solving math problems Math problems are a significant challenge for state-of-the-art LLMs. This project studies how LLMs solve math problems. We explore direct solving and chain-of-thought (CoT) prompting, aiming to understand and improve solution approaches. Methods: 1. Direct Solving: We give the model a question and ask for an answer. 2. Chain-of-Thought (CoT): We instruct the model to break down the problem into steps. We study: - Problem solving approach (structured vs. unstructured reasoning) - Common error patterns - Reasoning path analysis - Impact of formula knowledgeSelf-reported
65.4%
TextVQA
valSelf-reported
83.4%

License & Metadata

License
deepseek
Announcement Date
December 13, 2024
Last Updated
July 19, 2025

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.