Anthropic Releases Claude Opus 4.7: A Major Upgrade for Agentic Coding, High-Resolution Vision, and Long-Horizon Autonomous Tasks

Anthropic has launched Claude Opus 4.7, it’s newest frontier mannequin and a direct successor to Claude Opus 4.6. The launch is positioned as a centered enchancment moderately than a full generational leap, however the good points it delivers are substantial within the areas that matter most to builders constructing real-world AI-powered functions: agentic software program engineering, multimodal reasoning, and long-running autonomous process execution.

https://www.anthropic.com/information/claude-opus-4-7

What Exactly is Claude Opus 4.7?

Anthropic maintains a mannequin household with tiers — Haiku (quick and light-weight), Sonnet (balanced), and Opus (highest functionality). Opus 4.7 sits on the prime of this stack, under solely the newly previewed Claude Mythos, which Anthropic has stored in a restricted launch.

Opus 4.7 represents a notable enchancment on Opus 4.6 in superior software program engineering, with specific good points on essentially the most tough duties. Crucially, customers report with the ability to hand off their hardest coding work — the sort that beforehand wanted shut supervision — to Opus 4.7 with confidence, because it handles complicated, long-running duties with rigor and consistency, pays exact consideration to directions, and devises methods to confirm its personal outputs earlier than reporting again.

The mannequin verifying its personal outputs is a significant behavioral shift. Earlier fashions usually produced outcomes with out inner sanity checks; Opus 4.7 seems to shut that loop autonomously, which has important implications for CI/CD pipelines and multi-step agentic workflows.

Stronger Coding Benchmarks

Early testers have put some sharp numbers on the coding enhancements. On a 93-task coding benchmark, Opus 4.7 lifted decision by 13% over Opus 4.6, together with 4 duties that neither Opus 4.6 nor Sonnet 4.6 may remedy. On CursorBench — a widely-used developer analysis harness — Opus 4.7 cleared 70% versus Opus 4.6 at 58%. And for complicated multi-step workflows, one tester noticed a 14% achieve over Opus 4.6 at fewer tokens and a 3rd of the software errors — and notably, Opus 4.7 was the primary mannequin to cross their implicit-need exams, persevering with to execute by way of software failures that used to cease Opus chilly.

Improved Vision: 3× the Resolution of Prior Models

One of essentially the most technically concrete upgrades in Opus 4.7 is its multimodal functionality. Opus 4.7 can now settle for photographs as much as 2,576 pixels on the lengthy edge (~3.75 megapixels), greater than thrice as many pixels as prior Claude fashions. Many real-world functions — from computer-use brokers studying dense UI screenshots to information extraction from complicated engineering diagrams — fail not as a result of the mannequin lacks reasoning means, however as a result of it might’t resolve superb visible element. This opens up a wealth of multimodal makes use of that depend upon superb visible element: computer-use brokers studying dense screenshots, information extractions from complicated diagrams, and work that wants pixel-perfect references.

The influence in manufacturing has already been dramatic. One tester engaged on computer-use workflows reported that Opus 4.7 scored 98.5% on their visual-acuity benchmark versus 54.5% for Opus 4.6 — successfully eliminating their single largest Opus ache level.

This is a model-level change moderately than an API parameter, so photographs customers ship to Claude will merely be processed at greater constancy — although as a result of higher-resolution photographs devour extra tokens, customers who don’t require the additional element can downsample photographs earlier than sending them to the mannequin.

https://www.anthropic.com/information/claude-opus-4-7

A New Effort Level: `xhigh`, Plus Task Budgets

Developers working with the Claude API will discover two new levers for controlling compute spend.

First, Opus 4.7 introduces a brand new xhigh (‘further excessive’) effort stage between excessive and max, giving customers finer management over the tradeoff between reasoning and latency on onerous issues. In Claude Code, Anthropic staff has raised the default effort stage to xhigh for all plans. When testing Opus 4.7 for coding and agentic use circumstances, Anthropic recommends beginning with excessive or xhigh effort.
Second, process budgets are actually launching in public beta on the Claude Platform API, giving builders a strategy to information Claude’s token spend so it might prioritize work throughout longer runs. Together, these two controls give developer groups significant manufacturing levers — particularly related when operating parallelized agent pipelines the place per-call value and latency have to be managed fastidiously.

New in Claude Code: `/ultrareview` and Auto Mode for Max Users

Two new Claude Code options ship alongside Opus 4.7 which are price flagging for devs who use it as a part of their growth workflow. The new /ultrareview slash command produces a devoted evaluation session that reads by way of adjustments and flags bugs and design points {that a} cautious reviewer would catch. Anthropic is giving Pro and Max Claude Code customers three free ultrareviews to attempt it out. Think of it as a senior engineer evaluation cross on demand — helpful earlier than merging complicated PRs or transport to manufacturing.

Additionally, auto mode has been prolonged to Max customers. Auto mode is a brand new permissions possibility the place Claude makes choices in your behalf, which means you can run longer duties with fewer interruptions — and with much less danger than when you had chosen to skip all permissions. This is especially beneficial for brokers executing multi-step duties in a single day or throughout massive codebases.

File System-Based Memory for Long Multi-Session Work

A less-discussed however operationally important enchancment is how Opus 4.7 handles reminiscence. Opus 4.7 is best at utilizing file system-based reminiscence — it remembers necessary notes throughout lengthy, multi-session work and makes use of them to maneuver on to new duties that, consequently, want much less up-front context. On third-party benchmarks, the mannequin additionally achieved state-of-the-art outcomes on GDPval-AA, a third-party analysis of economically beneficial information work throughout finance, authorized, and different domains.

Key Takeaways

Claude Opus 4.7 is Anthropic’s strongest coding mannequin thus far, dealing with complicated, long-running agentic duties with far much less supervision than Opus 4.6 — and uniquely verifies its personal outputs earlier than reporting again.
Vision functionality has tripled, with help for photographs as much as ~3.75 megapixels, making it considerably extra dependable for computer-use brokers, diagram parsing, and any workflow that will depend on superb visible element.
A new xhigh effort stage and process budgets give builders exact management over the reasoning-vs-latency tradeoff and token spend — essential levers for operating cost-efficient multi-step agent pipelines in manufacturing.
Two main Claude Code options ship alongside the mannequin: the /ultrareview slash command for on-demand deep code evaluation, and auto mode — now prolonged to Max customers — which lets brokers run longer duties with fewer interruptions.

Check out the Technical details here. Also, be happy to comply with us on Twitter and don’t overlook to affix our 130k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to associate with us for selling your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar and so forth.? Connect with us

The submit Anthropic Releases Claude Opus 4.7: A Major Upgrade for Agentic Coding, High-Resolution Vision, and Long-Horizon Autonomous Tasks appeared first on MarkTechPost.

Anthropic Releases Claude Opus 4.7: A Major Upgrade for Agentic Coding, High-Resolution Vision, and Long-Horizon Autonomous Tasks

What Exactly is Claude Opus 4.7?

Stronger Coding Benchmarks

Improved Vision: 3× the Resolution of Prior Models

A New Effort Level: `xhigh`, Plus Task Budgets

New in Claude Code: `/ultrareview` and Auto Mode for Max Users

File System-Based Memory for Long Multi-Session Work

Key Takeaways

Google Cloud Introduces Open Knowledge Format (OKF): A Vendor-Neutral Markdown Spec for Giving AI Agents Curated Context

The universal tool calling protocol for agentic AI

DualDistill and Agentic-R1: How AI Combines Natural Language and Tool Use for Superior Math Problem Solving

Top 10 Local LLMs (2025): Context Windows, VRAM Targets, and Licenses Compared

Stochastic Gradient Descent (SGD’s) Frequency Bias and How Adam Fixes It

6 things to fix before RLHF turns your biases into features

Curated by experts. Filtered for relevance.

Resources

About

Subscribe & learn more every day!

What Exactly is Claude Opus 4.7?

Stronger Coding Benchmarks

Improved Vision: 3× the Resolution of Prior Models

A New Effort Level: xhigh, Plus Task Budgets

New in Claude Code: /ultrareview and Auto Mode for Max Users

File System-Based Memory for Long Multi-Session Work

Key Takeaways

Similar Posts

Curated by experts. Filtered for relevance.

Resources

About

Subscribe & learn more every day!

A New Effort Level: `xhigh`, Plus Task Budgets

New in Claude Code: `/ultrareview` and Auto Mode for Max Users