Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and more than Twice the Speed

Anthropic launched Claude Haiku 4.5, a latency-optimized “small” mannequin that delivers related ranges of coding efficiency to Claude Sonnet 4 whereas working more than twice as quick at one-third the value. The mannequin is straight away out there through Anthropic’s API and in accomplice catalogs on Amazon Bedrock and Google Cloud Vertex AI. Pricing is $1/MTok enter and $5/MTok output. Anthropic positions Haiku 4.5 as a drop-in alternative for Haiku 3.5 and Sonnet 4 in cost-sensitive, interactive workloads.

Positioning and lineup

Haiku 4.5 targets real-time assistants, customer-support automations, and pair-programming the place tight latency budgets and throughput dominate. It surpasses Sonnet 4 on “pc use” duties—the GUI/browser manipulation underpinning merchandise like Claude for Chrome—and is described as materially bettering responsiveness in Claude Code for multi-agent initiatives and fast prototyping. Anthropic makes clear that Sonnet 4.5 stays the frontier mannequin and “the greatest coding mannequin in the world,” whereas Haiku 4.5 affords near-frontier efficiency with larger cost-efficiency. A really useful sample is Sonnet 4.5 for multi-step planning and parallel execution by a pool of Haiku 4.5 employees.

Availability, identifiers, and pricing

From day one, builders can name the mannequin (claude-haiku-4-5) on Anthropic’s API. Anthropic additionally states availability on Amazon Bedrock and Vertex AI; mannequin catalogs might replace area protection and IDs over time, however the firm confirms cloud availability in the launch publish. The API worth for Haiku 4.5 is $1/MTok (enter) and $5/MTok (output), with prompt-caching listed at $1.25/MTok write and $0.10/MTok learn.

Benchmarks

Anthropic summarizes outcomes throughout customary and agentic suites and consists of methodology particulars to qualify the numbers:

SWE-bench Verified: easy scaffold with two instruments (bash, file edits), 73.3% averaged over 50 trials, no test-time compute, 128K pondering finances, default sampling. Includes a minor immediate addendum encouraging intensive instrument use and writing exams first.
Terminal-Bench: Terminus-2 agent, common over 11 runs (6 with out pondering, 5 with 32K pondering finances).
OSWorld-Verified: 100 max steps, averaged throughout 4 runs with a 128K complete pondering finances and 2K per-step configuration.
AIME / MMMLU: averages over a number of runs utilizing default sampling and 128K pondering budgets.

https://www.anthropic.com/information/claude-haiku-4-5

The publish emphasizes coding parity with Sonnet 4 and computer-use features relative to Sonnet 4 beneath these scaffolds. Users ought to replicate with their very own orchestration, instrument stacks, and pondering budgets earlier than generalizing.

Key Takeaways

Haiku 4.5 delivers Sonnet-4-level coding efficiency at one-third the value and more than twice the pace.
It surpasses Sonnet 4 on computer-use duties, bettering responsiveness in Claude for Chrome and multi-agent flows in Claude Code.
Recommended orchestration: use Sonnet 4.5 for multi-step planning and parallelize execution with a number of Haiku 4.5 employees.
Pricing is $1/$5 per million enter/output tokens; out there through Claude API, Amazon Bedrock, and Google Cloud Vertex AI.
Released beneath ASL-2 with a decrease measured misalignment charge than Sonnet 4.5 and Opus 4.1 in Anthropic’s exams.

Editorial Comments

Anthropic’s positioning of Claude Haiku 4.5 is strategically sound: by delivering related ranges of coding efficiency to Claude Sonnet 4 at one-third the value and more than twice the pace, whereas surpassing Sonnet 4 on pc use, the firm offers devs a clear planner–executor cut up—Sonnet 4.5 for multi-step planning and a pool of Haiku 4.5 employees for parallel execution—with out forcing architectural adjustments (“drop-in alternative” throughout API, Amazon Bedrock, Vertex AI). The ASL-2 launch, coupled with a documented decrease misalignment charge than Sonnet 4.5 and Opus 4.1, lowers the friction for enterprise rollout the place security gates and value envelopes dominate deployment math.