Name: Claude Sonnet 5: 82.1% SWE-bench & $3/M Pricing (2026)
Brand: Anthropic
Price: 3.00 USD
Availability: InStock

Question 1

What is Claude Sonnet 5 and who built it?

Accepted Answer

Claude Sonnet 5 is Anthropic's fifth-generation Sonnet-tier large language model, released June 30, 2026, roughly nine months after Sonnet 4.5 and three months after Sonnet 4.6. It is a dense, hybrid-reasoning Transformer with adaptive thinking on by default; Anthropic has not disclosed a parameter count. Anthropic describes it as "the best combination of speed and intelligence" in its lineup, designed to bring near-Opus agentic coding and computer-use performance down to Sonnet's price point. It scores 82.1% on SWE-bench Verified, the first model to break 80% on that benchmark, and 96.2% on GPQA Diamond. It sits below Claude Opus 4.8 and above Claude Haiku 4.5 in Anthropic's current lineup, and was built specifically to beat GPT-5.4 and Gemini 3.1 Pro on agentic coding and computer-use tasks. It launched with a 1M-token context window at $3 input / $15 output per 1M tokens.

Question 2

How much does Claude Sonnet 5 cost per 1M tokens?

Accepted Answer

Claude Sonnet 5 costs $3.00 per 1M input tokens and $15.00 per 1M output tokens at standard pricing, identical to Sonnet 4.6's rate card. Anthropic is running introductory pricing of $2.00 input / $10.00 output per 1M tokens through August 31, 2026 to offset the new tokenizer's roughly 30% higher token counts, after which standard pricing applies. Prompt cache reads run at 0.1x the input rate ($0.30 per 1M at standard pricing). The Batch API gets a 50% discount ($1.50/$7.50 per 1M) but is excluded from Zero Data Retention and retains data up to 29 days. A daily agentic coding loop processing 1M input and 200K output tokens costs roughly $6.00 at standard pricing. This is 40% cheaper than Claude Opus 4.8's $5/$25 per 1M on both input and output. There is no self-hosting option since the model is proprietary and API-only.

Question 3

What is Claude Sonnet 5's context window and max output?

Accepted Answer

Claude Sonnet 5 has a 1,000,000 token context window, matching Claude Opus 4.8 and shared across Anthropic's current top-tier models. Max output on the synchronous Messages API is 128,000 tokens, and the Message Batches API supports up to 300,000 output tokens using the output-300k-2026-03-24 beta header. The model uses the same new tokenizer introduced with Opus 4.7, which produces roughly 30% more tokens for identical input text than Sonnet 4.6's tokenizer, so raw token counts from the prior generation should be recounted rather than reused. This is a substantial jump from Sonnet 4.5's 200K context window. Long-context recall is reported as strong given the model's high scores on agentic, multi-step benchmarks that depend on maintaining state across long tool-use sessions. Document handling covers PDF and image input alongside text within that same 1M budget.

Question 4

How does Claude Sonnet 5 compare on benchmarks vs GPT-5.4 and Gemini 3.1 Pro?

Accepted Answer

Claude Sonnet 5 scores 82.1% on SWE-bench Verified, ahead of Gemini 3.1 Pro (80.6%) and GPT-5.4 (approximately 80%), making it the first model to clear the 80% mark on that benchmark. On the harder SWE-bench Pro variant it scores 63.2%, up from Sonnet 4.6's 58.1%, though Claude Opus 4.8 still leads at 69.2% on that specific test. Sonnet 5 posts 96.2% on GPQA Diamond and 84.7% on ARC-AGI-2, both reported as leading its comparison set. On the Artificial Analysis Intelligence Index v4.1, it scores 53 at max effort, placing fifth overall behind Claude Fable 5 (60), Opus 4.8 (56), GPT-5.5 xhigh (55), and Opus 4.7 (54). The practical takeaway of the SWE-bench gap is that Sonnet 5 now closes most of the coding-quality gap to Opus-tier models while costing 40% less per token. Anthropic has not published an MMLU-Pro or AIME 2025 score for Sonnet 5 specifically.

Question 5

Is Claude Sonnet 5 open source or proprietary?

Accepted Answer

Claude Sonnet 5 is fully proprietary, API-only, with no downloadable weights and no open license of any kind. It is accessible through the direct Anthropic Claude API (model ID claude-sonnet-5), Claude Platform on AWS, Amazon Bedrock (anthropic.claude-sonnet-5, both global and regional endpoints), Microsoft Foundry on Azure, and Google Vertex AI, which Anthropic listed as coming soon at launch. It also ships inside Claude Code and became generally available for GitHub Copilot on release day. Anthropic's commercial terms govern all usage; there is no research-only or non-commercial carve-out. Because the weights are closed, self-hosting, air-gapped deployment, and fine-tuning the base model are not available, unlike open-weights models such as Llama 4 or Qwen3.

Question 6

What modalities does Claude Sonnet 5 support?

Accepted Answer

Claude Sonnet 5 accepts text, image, and PDF input and produces text and tool-call output; it has no native audio or video input or output in either direction. It supports native function calling and tool use, structured JSON outputs, prompt caching, web search, and computer use (screen-reading and control for desktop automation). Computer-use performance jumped sharply versus Sonnet 4.6: 81.2% on OSWorld-Verified (up from 78.5%) and 80.4% on Terminal-Bench 2.1 (up from 67.0%). Adaptive thinking is on by default for every request rather than an opt-in extended-thinking mode, a change from Sonnet 4.6's behavior. Teams building voice or video applications will need to pair Sonnet 5 with a separate ASR/TTS or video model, since neither modality is natively supported.

Question 7

Does Claude Sonnet 5 train on user data?

Accepted Answer

No. Anthropic does not train on API inputs or outputs by default. Standard API data is retained according to Anthropic's normal retention policy, and Zero Data Retention is available for eligible customers, meaning inputs are not stored at rest after the response is returned except where required by law or to combat misuse. The Message Batches API is the one exception: it is not eligible for Zero Data Retention and retains request and response data for up to 29 days. Anthropic holds SOC 2 Type I and Type II, ISO 27001:2022, and ISO/IEC 42001:2023 certifications, and signs HIPAA Business Associate Agreements covering the first-party API and Enterprise plans for organizations handling protected health information. The EU AI Act classifies Sonnet 5 as a general-purpose AI model with systemic risk obligations given its capability level and reach.

Question 8

Who is Claude Sonnet 5 best for and who should avoid it?

Accepted Answer

Claude Sonnet 5 is best for teams running high-volume agentic coding loops that need near-Opus SWE-bench performance without Opus pricing, teams automating computer-use and terminal workflows given its jump to 81.2% OSWorld-Verified and 80.4% Terminal-Bench 2.1, and enterprise teams that need SOC 2, HIPAA-ready, and Zero Data Retention options at Sonnet-tier cost. It also suits anyone migrating off Sonnet 4.6 who wants a same-price upgrade with a real capability jump. It is a poor fit for voice or video-first products since it has no native audio or video I/O, for teams reusing exact Sonnet 4.6 sampling configs since non-default temperature, top_p, or top_k now return a 400 error, and for dangerous-capability-adjacent cyber research, where Anthropic explicitly states Sonnet 5 underperforms Claude Mythos 5 by design. Teams needing the absolute highest coding ceiling on SWE-bench Pro should still consider Opus 4.8, which leads that specific benchmark at 69.2% versus Sonnet 5's 63.2%.

Claude Sonnet 5: 82.1% SWE-bench & $3/M Pricing (2026)

About Claude Sonnet 5

Pricing

Key Features

Pros

Cons

Benchmarks

Frequently Asked Questions