Anthropic logo

Claude Sonnet 4.6

Multimodal
Anthropic

Claude Sonnet 4.6 is a complete upgrade to the Sonnet-class model with improvements in coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Users preferred Sonnet 4.6 over Sonnet 4.5 approximately 70% of the time. The first Sonnet-class model with a 1M token context window (beta) and context compaction. Significant improvement in computer use skills compared to previous Sonnet models.

Key Specifications

Parameters
-
Context
200.0K
Release Date
February 17, 2026
Average Score
73.6%

Timeline

Key dates in the model's history
Announcement
February 17, 2026
Last Update
February 20, 2026
Today
March 25, 2026

Technical Specifications

Parameters
-
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval

Pricing & Availability

Input (per 1M tokens)
$3.00
Output (per 1M tokens)
$15.00
Max Input Tokens
200.0K
Max Output Tokens
64.0K
Supported Features
Function CallingStructured OutputCode ExecutionWeb SearchBatch InferenceFine-tuning

Benchmark Results

Model performance metrics across various tests and benchmarks

Programming

Programming skills tests
SWE-bench Verified
SWE-bench Verified — benchmark for evaluation abilities model solve real tasks from GitHub-Self-reported
79.6%

Reasoning

Logical reasoning and analysis
GPQA
GPQA Diamond — benchmark for evaluation abilities model answer on questions level PhD by andSelf-reported
89.9%

Other Tests

Specialized benchmarks
ARC-AGI v2
ARC-AGI v2 — benchmark for evaluation abilities to reasoning and generalizationSelf-reported
58.3%
MMMLU
MMMLU — version MMLU for evaluation knowledge model on languagesSelf-reported
89.3%
CharXiv-R
CharXiv-R — benchmark for evaluation abilities model understand and reason about andSelf-reported
74.7%
MMMU-Pro
MMMU-Pro — version MMMU for evaluation on level expertsSelf-reported
75.6%
HLE
HLE (Humanity's Last Exam) — benchmark from questions, experts for verification knowledge AISelf-reported
49.0%
SimpleQA
SimpleQA — benchmark for evaluation actual accuracy answers model on simple questions.Self-reported
72.5%

License & Metadata

License
proprietary
Announcement Date
February 17, 2026
Last Updated
February 20, 2026

Compare Claude Sonnet 4.6

All comparisons

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.