Google logo

Gemini 3 Flash

Multimodal
Google

Gemini 3 Flash offers frontier-level intelligence built for speed at a fraction of the cost. Combines the reasoning quality of Gemini 3 Pro with Flash-level latency, efficiency, and cost. Features a 1 million token input context window and is optimized for agentic workflows, coding, and complex analysis.

Key Specifications

Parameters
-
Context
1.0M
Release Date
December 16, 2025
Average Score
94.0%

Timeline

Key dates in the model's history
Announcement / Last Update
December 16, 2025
Today
March 25, 2026

Technical Specifications

Parameters
-
Training Tokens
-
Knowledge Cutoff
January 1, 2025
Family
-
Capabilities
MultimodalZeroEval

Pricing & Availability

Input (per 1M tokens)
$0.50
Output (per 1M tokens)
$3.00
Max Input Tokens
1.0M
Max Output Tokens
65.5K
Supported Features
Function CallingStructured OutputCode ExecutionWeb SearchBatch InferenceFine-tuning

Benchmark Results

Model performance metrics across various tests and benchmarks

Reasoning

Logical reasoning and analysis
GPQA
Without toolsSelf-reported
90.0%

Other Tests

Specialized benchmarks
AIME 2025
With codeSelf-reported
100.0%
MMMLU
Self-reported
92.0%

License & Metadata

License
proprietary
Announcement Date
December 16, 2025
Last Updated
December 16, 2025

Compare Gemini 3 Flash

All comparisons

Articles about Gemini 3 Flash

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.