Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and more than Twice the Speed

Anthropic launched Claude Haiku 4.5, a latency-optimized “small” mannequin that delivers related ranges of coding efficiency to Claude Sonnet 4 whereas working more than twice as quick at one-third the value. The mannequin is straight away out there through Anthropic’s API and in accomplice catalogs on Amazon Bedrock and Google Cloud Vertex AI. Pricing is $1/MTok enter and $5/MTok output. Anthropic positions Haiku 4.5 as a drop-in alternative for Haiku 3.5 and Sonnet 4 in cost-sensitive, interactive workloads.
Positioning and lineup
Haiku 4.5 targets real-time assistants, customer-support automations, and pair-programming the place tight latency budgets and throughput dominate. It surpasses Sonnet 4 on “pc use” duties—the GUI/browser manipulation underpinning merchandise like Claude for Chrome—and is described as materially bettering responsiveness in Claude Code for multi-agent initiatives and fast prototyping. Anthropic makes clear that Sonnet 4.5 stays the frontier mannequin and “the greatest coding mannequin in the world,” whereas Haiku 4.5 affords near-frontier efficiency with larger cost-efficiency. A really useful sample is Sonnet 4.5 for multi-step planning and parallel execution by a pool of Haiku 4.5 employees.
Availability, identifiers, and pricing
From day one, builders can name the mannequin (claude-haiku-4-5
) on Anthropic’s API. Anthropic additionally states availability on Amazon Bedrock and Vertex AI; mannequin catalogs might replace area protection and IDs over time, however the firm confirms cloud availability in the launch publish. The API worth for Haiku 4.5 is $1/MTok (enter) and $5/MTok (output), with prompt-caching listed at $1.25/MTok write and $0.10/MTok learn.
Benchmarks
Anthropic summarizes outcomes throughout customary and agentic suites and consists of methodology particulars to qualify the numbers:
- SWE-bench Verified: easy scaffold with two instruments (bash, file edits), 73.3% averaged over 50 trials, no test-time compute, 128K pondering finances, default sampling. Includes a minor immediate addendum encouraging intensive instrument use and writing exams first.
- Terminal-Bench: Terminus-2 agent, common over 11 runs (6 with out pondering, 5 with 32K pondering finances).
- OSWorld-Verified: 100 max steps, averaged throughout 4 runs with a 128K complete pondering finances and 2K per-step configuration.
- AIME / MMMLU: averages over a number of runs utilizing default sampling and 128K pondering budgets.


The publish emphasizes coding parity with Sonnet 4 and computer-use features relative to Sonnet 4 beneath these scaffolds. Users ought to replicate with their very own orchestration, instrument stacks, and pondering budgets earlier than generalizing.
Key Takeaways
- Haiku 4.5 delivers Sonnet-4-level coding efficiency at one-third the value and more than twice the pace.
- It surpasses Sonnet 4 on computer-use duties, bettering responsiveness in Claude for Chrome and multi-agent flows in Claude Code.
- Recommended orchestration: use Sonnet 4.5 for multi-step planning and parallelize execution with a number of Haiku 4.5 employees.
- Pricing is $1/$5 per million enter/output tokens; out there through Claude API, Amazon Bedrock, and Google Cloud Vertex AI.
- Released beneath ASL-2 with a decrease measured misalignment charge than Sonnet 4.5 and Opus 4.1 in Anthropic’s exams.
Editorial Comments
Anthropic’s positioning of Claude Haiku 4.5 is strategically sound: by delivering related ranges of coding efficiency to Claude Sonnet 4 at one-third the value and more than twice the pace, whereas surpassing Sonnet 4 on pc use, the firm offers devs a clear planner–executor cut up—Sonnet 4.5 for multi-step planning and a pool of Haiku 4.5 employees for parallel execution—with out forcing architectural adjustments (“drop-in alternative” throughout API, Amazon Bedrock, Vertex AI). The ASL-2 launch, coupled with a documented decrease misalignment charge than Sonnet 4.5 and Opus 4.1, lowers the friction for enterprise rollout the place security gates and value envelopes dominate deployment math.
Check out the Technical details, system card, model page, and documentation . Feel free to take a look at our GitHub Page for Tutorials, Codes and Notebooks. Also, be at liberty to observe us on Twitter and don’t overlook to affix our 100k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.
The publish Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and more than Twice the Speed appeared first on MarkTechPost.