Claude Sonnet 4.6

Name: Claude Sonnet 4.6
Author: Anthropic

Multimodal

Anthropic

Claude Sonnet 4.6 is a complete upgrade to the Sonnet-class model with improvements in coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Users preferred Sonnet 4.6 over Sonnet 4.5 approximately 70% of the time. The first Sonnet-class model with a 1M token context window (beta) and context compaction. Significant improvement in computer use skills compared to previous Sonnet models.

Key Specifications

Parameters

Context

200.0K

Release Date

February 17, 2026

Average Score

73.6%

API Documentation Results Blog

Timeline

Key dates in the model's history

Announcement

February 17, 2026

Last Update

February 20, 2026

Today

July 7, 2026

Technical Specifications

Parameters

Training Tokens

Knowledge Cutoff

Family

Capabilities

MultimodalZeroEval

Pricing & Availability

Input (per 1M tokens)

$3.00

Output (per 1M tokens)

$15.00

Max Input Tokens

200.0K

Max Output Tokens

64.0K

Supported Features

Function CallingStructured OutputCode ExecutionWeb SearchBatch InferenceFine-tuning

Benchmark Results

Model performance metrics across various tests and benchmarks

Programming

Programming skills tests

SWE-bench Verified

SWE-bench Verified — benchmark for evaluation abilities model solve real tasks from GitHub- • Self-reported

79.6%

Reasoning

Logical reasoning and analysis

GPQA

GPQA Diamond — benchmark for evaluation abilities model answer on questions level PhD by and • Self-reported

89.9%

Other Tests

Specialized benchmarks

ARC-AGI v2

ARC-AGI v2 — benchmark for evaluation abilities to reasoning and generalization • Self-reported

58.3%

MMMLU

MMMLU — version MMLU for evaluation knowledge model on languages • Self-reported

89.3%

CharXiv-R

CharXiv-R — benchmark for evaluation abilities model understand and reason about and • Self-reported

74.7%

MMMU-Pro

MMMU-Pro — version MMMU for evaluation on level experts • Self-reported

75.6%

HLE

HLE (Humanity's Last Exam) — benchmark from questions, experts for verification knowledge AI • Self-reported

49.0%

SimpleQA

SimpleQA — benchmark for evaluation actual accuracy answers model on simple questions. • Self-reported

72.5%

License & Metadata

License

proprietary

Announcement Date

February 17, 2026

Last Updated

February 20, 2026

Articles about Claude Sonnet 4.6

Claude Can Now Control Your Computer. Anthropic Says Trust It — Mostly.

Anthropic shipped computer use for Claude Code and Cowork — mouse, keyboard, browser, files. Plus a new Auto Mode that skips permission prompts. macOS only.

March 26, 2026

7 min

Similar Models

All Models

Claude Opus 4.6

Anthropic

Best score:1.0 (TAU)

Released:Feb 2026

Price:$5.00/1M tokens

Claude Sonnet 4.5

Anthropic

Best score:0.9 (TAU)

Released:Sep 2025

Price:$3.00/1M tokens

Claude Opus 4.1

Anthropic

Best score:0.8 (TAU)

Released:Aug 2025

Price:$15.00/1M tokens

Claude Opus 4.5

Anthropic

Best score:0.9 (TAU)

Released:Nov 2025

Price:$5.00/1M tokens

Claude 3 Opus

Anthropic

Best score:1.0 (ARC)

Released:Feb 2024

Price:$15.00/1M tokens

Claude 3.7 Sonnet

Anthropic

Best score:0.8 (GPQA)

Released:Feb 2025

Price:$3.00/1M tokens

Claude 3 Sonnet

Anthropic

Best score:0.9 (ARC)

Released:Feb 2024

Price:$3.00/1M tokens

Claude 3.5 Sonnet

Anthropic

Best score:0.9 (HumanEval)

Released:Oct 2024

Price:$3.00/1M tokens

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.