Fastest AI Models
Top AI models by generation speed — maximum throughput via API.
| # | Model | Score |
|---|---|---|
GPT OSS 20B | 37.8% | |
GPT-5 nano | 50.1% | |
GPT OSS 120B | 45.6% | |
| 4 | Grok-4.1 Fast Non-Reasoning | — |
| 5 | Step-3.5-Flash | 77.6% |
| 6 | Grok-4 Fast Reasoning | — |
| 7 | Qwen3 32B | 75.3% |
| 8 | GPT-5.4 mini | 82.2% |
| 9 | Gemini 3 Flash | 94.0% |
| 10 | Claude Opus 4.5 | 87.8% |
| 11 | GPT-5.4 nano | 74.5% |
| 12 | Grok-4.1 Fast Reasoning | — |
| 13 | GPT-5.1 Codex | 64.3% |
| 14 | GPT-5.1 Instant | 87.6% |
| 15 | GPT-5 mini | 60.0% |
Want a detailed comparison of the leaders? → GPT OSS 20B vs GPT-5 nano
Other Collections
Coding
Top AI models for writing code, debugging, and solving real-world development tasks.
Math
Top AI models for solving mathematical problems and computations.
Reasoning
Top AI models for reasoning tasks, analysis, and expert knowledge.
Cheap
Top AI models with the lowest API prices — maximum quality for minimum cost.
Multimodal
Top AI models with support for text, images, audio, and video.
Large Context
Top AI models with the largest context window — for working with long documents.
General Knowledge
Top AI models on the MMLU benchmark — broad knowledge across dozens of subjects.
Function Calling
Top AI models for function calling and agentic tool use.
Image Analysis
Top AI models for analyzing charts, diagrams, and visual data.
Free
AI models with free API access — open source and free-tier providers.
AI Agents
Top AI models for building autonomous agents with tool use and planning.
Ranking updated in 2026 based on official benchmarks and independent tests. See also model comparisons and the full catalog.