Anthropic logo

Claude Sonnet 4.6

Multimodal
Anthropic

Claude Sonnet 4.6 is a complete upgrade to the Sonnet-class model with improvements in coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Users preferred Sonnet 4.6 over Sonnet 4.5 approximately 70% of the time. The first Sonnet-class model with a 1M token context window (beta) and context compaction. Significant improvement in computer use skills compared to previous Sonnet models.

Key Specifications

Parameters
-
Context
200.0K
Release Date
February 17, 2026
Average Score
73.6%

Timeline

Key dates in the model's history
Announcement
February 17, 2026
Last Update
February 20, 2026
Today
May 9, 2026

Technical Specifications

Parameters
-
Training Tokens
-
Knowledge Cutoff
-
Family
-
Capabilities
MultimodalZeroEval

Pricing & Availability

Input (per 1M tokens)
$3.00
Output (per 1M tokens)
$15.00
Max Input Tokens
200.0K
Max Output Tokens
64.0K
Supported Features
Function CallingStructured OutputCode ExecutionWeb SearchBatch InferenceFine-tuning

Benchmark Results

Model performance metrics across various tests and benchmarks

Programming

Programming skills tests
SWE-bench Verified
SWE-bench Verified — benchmark for evaluation abilities model solve real tasks from GitHub-Self-reported
79.6%

Reasoning

Logical reasoning and analysis
GPQA
GPQA Diamond — benchmark for evaluation abilities model answer on questions level PhD by andSelf-reported
89.9%

Other Tests

Specialized benchmarks
ARC-AGI v2
ARC-AGI v2 — benchmark for evaluation abilities to reasoning and generalizationSelf-reported
58.3%
MMMLU
MMMLU — version MMLU for evaluation knowledge model on languagesSelf-reported
89.3%
CharXiv-R
CharXiv-R — benchmark for evaluation abilities model understand and reason about andSelf-reported
74.7%
MMMU-Pro
MMMU-Pro — version MMMU for evaluation on level expertsSelf-reported
75.6%
HLE
HLE (Humanity's Last Exam) — benchmark from questions, experts for verification knowledge AISelf-reported
49.0%
SimpleQA
SimpleQA — benchmark for evaluation actual accuracy answers model on simple questions.Self-reported
72.5%

License & Metadata

License
proprietary
Announcement Date
February 17, 2026
Last Updated
February 20, 2026

Compare Claude Sonnet 4.6

All comparisons

Articles about Claude Sonnet 4.6

Similar Models

All Models

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.