When was Claude Opus 4.7 released?

Claude Opus 4.7 was released on April 16, 2026 by Anthropic. It is available on Claude.ai (Pro, Max, Team, Enterprise plans), direct API, Amazon Bedrock, Google Cloud Vertex AI and Microsoft Foundry.

How much does Claude Opus 4.7 cost?

Pricing remains unchanged from Opus 4.6: $5 per million input tokens and $25 per million output tokens. However, the updated tokenizer generates between 1.0x and 1.35x more tokens from the same text, so the actual cost per document may be slightly higher.

What is the most significant improvement in Opus 4.7?

It depends on the application area. For coding, the Rakuten benchmark shows 3x the tasks solved in production compared to Opus 4.6. For document vision, visual acuity goes from 54.5% to 98.5%. For the legal sector, BigLaw Bench reaches 90.9% accuracy.

Is Claude Opus 4.7 available on Amazon Bedrock?

Yes. Claude Opus 4.7 is available on Amazon Bedrock, Google Cloud Vertex AI and Microsoft Foundry, as well as on direct Anthropic API and Claude.ai. The API model ID is claude-opus-4-7.

Is Claude Opus 4.7 better than Opus 4.6 in everything?

On benchmarks published by Anthropic and partners, Opus 4.7 outperforms Opus 4.6 in all tested areas: coding, document vision, legal analysis, financial analysis, multi-step workflows. However, the updated tokenizer can increase API costs up to 35% for the same text input.

Does the Opus 4.7 tokenizer always increase costs?

Not always by the same amount. The increase is between 1.0x and 1.35x compared to Opus 4.6, varying based on text type. Short or structured texts tend to increase less; long narrative texts may approach 35% increase. Validation on your own typical documents is necessary before migrating.

Can I use Opus 4.7 with the Claude Team plan?

Yes. Claude Opus 4.7 is available on all Claude.ai plans including Team. For managed plans (Team and Enterprise), pricing doesn't change directly with the updated tokenizer, but included usage limits may vary.

What is the xhigh level in Opus 4.7 effort control?

xhigh is a new effort control level introduced with Opus 4.7, placed between high and max. It allows calibrating the balance between reasoning depth and latency with greater precision than previous levels. It is useful for tasks requiring structured reasoning where max latency is problematic.

What is visual acuity and why is it important for documents?

Visual acuity measures an AI model's ability to correctly interpret visual content in a document: handwritten text, tables, charts, diagrams, chemical structures. It differs from classic OCR because it requires understanding visual context, not just character recognition. XBOW is the specific benchmark that measured Opus 4.7's 98.5% against Opus 4.6's 54.5%.

Can Claude Opus 4.7 reliably read scanned invoices?

With 98.5% visual acuity, Opus 4.7 is significantly more reliable than Opus 4.6 on scanned invoice processing. Actual accuracy depends on scan quality and document layout. For operational use, a hybrid workflow with confidence scores and human review on low-confidence documents is recommended.

What is the maximum image resolution supported by Opus 4.7?

Claude Opus 4.7 supports images up to 2,576 pixels on the long side, equivalent to about 3.75 megapixels. This is more than three times the limit of previous Claude models and allows processing A4 documents scanned at high resolution without prior reduction.

Can Claude Opus 4.7 interpret chemical structures and technical diagrams?

Yes. Anthropic explicitly identifies chemical structures and technical diagrams as primary use cases for Opus 4.7's vision capabilities. The improvement in visual acuity makes Opus 4.7 significantly more reliable than 4.6 on these types of specialized content.

Is Claude Opus 4.7 the best model for coding?

On published benchmarks — CursorBench 70%, Rakuten-SWE-Bench 3x tasks solved, CodeRabbit +10% recall — Opus 4.7 sets new coding AI references. The superiority is most marked on complex tasks requiring contextual understanding of real codebases, less so on simple tasks where Sonnet is comparable.

What does Rakuten-SWE-Bench measure?

Rakuten-SWE-Bench measures the ability to resolve real issues on production GitHub repositories: bug reports and feature requests from real projects, not synthetic problems. Opus 4.7 solves 3x more tasks than Opus 4.6 on this benchmark, indicating real improvement on concrete code problems.

Can I use Claude Opus 4.7 for automated code review?

Yes. CodeRabbit, a platform specialized in AI code review, recorded more than 10% improvement in recall with Opus 4.7 compared to Opus 4.6. This means Opus 4.7 identifies more problems in code during pull requests. Integration can happen directly via Anthropic API or through platforms like CodeRabbit.

Should I use Opus 4.7 for all code or only complex tasks?

Only for complex tasks. Sonnet remains the optimal choice for completing simple functions, documentation, boilerplate and routine coding — with significantly lower cost per token and comparable quality. Opus 4.7 should be reserved for debugging complex codebases, structural refactoring, critical code review and code generation for systems where correctness is essential.

What does the Notion Agent benchmark measure?

The Notion Agent benchmark measures Claude's ability to complete multi-step workflows within the Notion environment: researching and synthesizing from multiple sources, updating documents, creating structures from specifications, managing editorial workflows. Opus 4.7 records a 14% improvement over Opus 4.6.

What is the xhigh effort control in Claude Opus 4.7?

xhigh is a new effort control level introduced with Opus 4.7, placed between high and max. It allows calibrating the balance between reasoning depth and latency with greater precision. It is useful in agentic workflows for complex analysis steps where max is excessive but high is insufficient.

Does Claude Opus 4.7 support multi-day enterprise workflows?

Yes. Anthropic identifies multi-day enterprise workflows on spreadsheets, slides and documents as a primary use case for Opus 4.7. The 1 million token context window ensures the entire workflow history remains accessible to the model during execution.

Which model for AI agents: Opus 4.7 or Sonnet?

It depends on workflow complexity and cost/latency constraints. Opus 4.7 is the choice for complex agentic workflows requiring structured planning, multi-step decisions and coherence over long reasoning chains. Sonnet is more suitable for simpler agents or execution steps within an Opus 4.7 workflow.

What is BigLaw Bench and why is it relevant for the legal sector?

BigLaw Bench is the benchmark developed by Harvey to measure AI model capabilities on complex legal tasks: contract analysis, legal research, drafting legal opinions. It's built on real tasks from top-tier law firms. Claude Opus 4.7 reaches 90.9% accuracy on this benchmark.

Can Claude Opus 4.7 replace a lawyer in contract analysis?

No. The 90.9% accuracy on BigLaw Bench is a relevant result, but the 9.1% error rate implies professional supervision remains necessary for high-impact tasks. Opus 4.7 is an acceleration and coverage tool — analyzes more documents in less time — but professional judgment on critical contracts remains the responsibility of legal professionals.

Can Claude Opus 4.7 analyze historical scanned contracts?

Yes. With 98.5% visual acuity (XBOW benchmark), Opus 4.7 makes automated processing of scanned contract archives viable. The workflow includes high-resolution scanning, processing with Opus 4.7 for extraction and indexing, and quality check for low-confidence documents.

How is Claude Opus 4.7 used for legal due diligence?

The recommended workflow is in three phases: automatic triage of all documents to identify high-risk ones; in-depth analysis of critical documents with Opus 4.7 at xhigh effort; synthesis into a legal risk matrix. This approach increases data room coverage while maintaining analysis quality on critical documents.

What is Databricks OfficeQA Pro and what does it measure for Claude?

Databricks OfficeQA Pro is a benchmark measuring the ability to correctly answer questions about real enterprise documents: reports, spreadsheets, presentations with irregular formatting and cross-references. Opus 4.7 makes 21% fewer errors than Opus 4.6 on this benchmark.

What does the General Finance module score of 0.813 mean concretely?

The General Finance module measures reasoning capability on financial analysis tasks: financial statements, financial ratios, variance analysis. The 0.813 score of Opus 4.7 (against 0.767 for Opus 4.6 on a 0-1 scale) indicates greater accuracy on structured financial calculations and analysis.

Can Claude Opus 4.7 read and analyze financial statements in PDF?

Yes. Opus 4.7 can analyze financial statements in PDF format and produce financial ratios, year-over-year changes and anomaly flags. The 98.5% visual acuity makes extraction from scanned PDFs more reliable than Opus 4.6. Verification on critical numerical calculations remains recommended.

What improvements does Opus 4.7 bring in missing data handling?

Hex reports Opus 4.7 superior performance on missing data handling: the model identifies and flags data gaps, uses context-appropriate imputation methods, and explicitly communicates the uncertainty introduced by missing values — behaviors more correct than Opus 4.6.

Claude Opus 4.7: New Features, Benchmarks and Availability (April 2026)

What is Claude Opus 4.7 and what changes from 4.6

On April 16, 2026, Anthropic released Claude Opus 4.7, the direct successor to Opus 4.6 and the new flagship model in the Claude family. It is a hybrid reasoning model: it combines immediate response capabilities with deep reasoning on demand, optimizing the balance between output quality and latency.

The context window remains at 1 million tokens — one of the highest values among large models available on the market. This allows analyzing entire code repositories, multiple contracts, complex financial documents or meeting transcripts without losing the thread of reasoning.

The most important structural technical change is the updated tokenizer: the same text input generates between 1.0x and 1.35x more tokens compared to Opus 4.6. This has direct implications for API costs and budget planning. We cover this in detail in the Claude Opus 4.7 vs Opus 4.6 article.

Another notable technical innovation is the new effort control level: `xhigh`, placed between `high` and `max`. It allows developers to calibrate the balance between reasoning depth and response latency with greater precision — useful for enterprise workflows where response time is a critical constraint.

Official benchmarks: the numbers that matter

Anthropic published a series of benchmarks with industry partners that show substantial improvements over Opus 4.6.

On coding, CursorBench measures the completion of real programming tasks: Opus 4.7 reaches 70% against Opus 4.6's 58%. Rakuten-SWE-Bench, which measures solving software development tasks in production environments, shows that Opus 4.7 solves 3x more tasks than its predecessor. CodeRabbit, specialized in automated code review, records a more than 10% improvement in code problem recall.

In vision and document analysis, the most relevant result comes from XBOW, measuring visual acuity — the ability to correctly interpret images and visual documents: Opus 4.7 reaches 98.5% against 54.5% for Opus 4.6. A 44-percentage-point jump that opens new scenarios for processing scanned invoices, paper contracts, technical diagrams and chemical structures. Image support reaches up to 2,576 pixels on the long side (about 3.75 megapixels), more than three times the limit of previous Claude models.

For the legal sector, Harvey reports 90.9% accuracy on BigLaw Bench, the reference benchmark for complex legal tasks. In financial analysis, the General Finance module scores 0.813 vs 0.767 for Opus 4.6, with Hex reporting superior performance on missing data handling.

For multi-step workflows, Notion Agent records +14% compared to Opus 4.6, while Databricks OfficeQA Pro shows 21% fewer errors on enterprise document analysis tasks.

Pricing and availability: where to find it and what it costs

Claude Opus 4.7 is available from release day on all major channels: Claude.ai in Pro, Max, Team and Enterprise plans; direct Claude API; Amazon Bedrock; Google Cloud Vertex AI; Microsoft Foundry. The API model ID is `claude-opus-4-7`.

Pricing remains unchanged from Opus 4.6: $5 per million input tokens, $25 per million output tokens. At first glance this seems like good news — but the updated tokenizer (which generates up to 35% more tokens from the same text) means the same document will cost slightly more to process than before. For heavy API users, the impact needs to be calculated based on your actual volume.

For enterprises using Claude through managed plans (Team or Enterprise), pricing doesn't change directly. What changes is the quality of responses obtained, which justifies transitioning to the new model. For those evaluating available plans, the guide on what Claude costs for enterprises is the updated reference.

Availability on Amazon Bedrock, Google Cloud Vertex AI and Microsoft Foundry means enterprises with existing cloud infrastructure can integrate Opus 4.7 without migrating to the direct Anthropic API — important flexibility for IT teams with pre-defined architecture constraints.

Primary use cases according to Anthropic

Anthropic has identified five primary application areas for Opus 4.7, all supported by published benchmarks.

Advanced coding and code review: the combined CursorBench 70%, Rakuten 3x and CodeRabbit +10% defines Opus 4.7 as the reference model for development teams working on complex codebases. Not just code writing, but pull request analysis, bug identification and structural refactoring.

AI agents for multi-tool workflows: the +14% from Notion on multi-step workflows and the new `xhigh` effort control level make Opus 4.7 particularly suited for agents orchestrating multiple tools in sequence. For those building agentic solutions, the Claude Opus 4.7 for AI agents article covers the technical details.

Multi-day enterprise workflows: complex spreadsheets, slide decks, policy documents — tasks requiring coherence over long time horizons and large volumes of context.

Computer vision: chemical structures, technical diagrams, engineering blueprints. The jump in visual acuity (98.5% vs 54.5%) makes Opus 4.7 a practical tool for automated analysis of non-natively digital documents.

Document analysis and reasoning: contracts, financial statements, due diligence reports. The combination of wide context window and improved reasoning makes it suitable for tasks where document understanding must be precise and verifiable.

Want to adopt Claude Opus 4.7 in your enterprise?

30 minutes to discuss your specific case.

Book a call

What it means for enterprises using Claude today

If your company already uses Claude — via API, Claude.ai or a Team/Enterprise plan — Opus 4.7 is a natural upgrade for tasks where quality matters more than cost per token.

The practical advice is to start evaluating Opus 4.7 on your most critical workflows: those where an error is costly, where document complexity is high, or where output feeds important decisions. The benchmarks indicate real and measurable gains — but validation on your specific use case remains necessary.

For API users, the updated tokenizer requires re-examining cost estimates. It's not a marginal change: up to 35% more tokens from the same text impacts budgets at high volumes. The Claude Opus 4.7 vs Opus 4.6 article analyzes this in detail with practical examples.

For development teams, the Rakuten benchmark (3x tasks solved in production) is the most significant data point. It's not a synthetic benchmark: it measures the ability to solve real problems on real codebases. For those using Claude in the software development cycle, testing Opus 4.7 directly is worthwhile. The Claude Opus 4.7 for coding article covers the details.

Maverick AI supports enterprises in adopting and evaluating Claude Opus 4.7, from choosing the right plan to integrating with existing systems. If you want to understand how Opus 4.7 fits into your AI architecture, talk to our team.

Benchmarks compared: what improved and by how much

The direct comparison between Opus 4.7 and Opus 4.6 shows cross-cutting improvements, with some areas recording particularly significant jumps.

On coding, CursorBench measures the ability to complete real programming tasks: Opus 4.7 reaches 70% against Opus 4.6's 58%, a gain of 12 percentage points. The most relevant data comes from Rakuten-SWE-Bench, measuring resolution of tasks in real production environments: Opus 4.7 solves 3x more tasks than its predecessor. CodeRabbit reports more than 10% improvement in code problem recall during code reviews.

In document vision, the jump is the most marked of all: XBOW measures visual acuity — the ability to interpret visual documents, diagrams and images — and Opus 4.7 reaches 98.5% against Opus 4.6's 54.5%. Forty-four percentage points of difference. Image support moves to a maximum of 2,576 pixels on the long side (about 3.75 megapixels), more than three times the limit of previous Claude models.

For data analysis, Databricks OfficeQA Pro records 21% fewer errors on document analysis tasks, while the General Finance module moves from 0.767 to 0.813. Hex reports superior performance on missing data handling in complex analyses.

For the legal sector, Harvey reports 90.9% accuracy on BigLaw Bench. For multi-step workflows, Notion Agent records +14%. These figures are documented by Anthropic and the cited partners.

The updated tokenizer: API cost implications

The technical change with the most immediate implications for enterprise budgets is the updated tokenizer. The same input text generates between 1.0x and 1.35x more tokens compared to Opus 4.6. In practice: a document that with Opus 4.6 consumed 10,000 tokens with Opus 4.7 may consume between 10,000 and 13,500.

Pricing remains unchanged ($5/M input, $25/M output), so the cost increase is proportional to the token increase. For those processing large volumes of documents — contracts, financial reports, codebases — the impact needs to be quantified before migrating to Opus 4.7 for all workflows.

The 1.0x-1.35x variation is not uniform: it depends on the type of text. Texts with repetitive structures, source code or mathematical formulas tend to be less affected; narrative texts in languages with rich morphology may approach the upper limit. The practical suggestion is to test your typical documents before migrating the entire pipeline.

For enterprises using Claude through managed plans (Team or Enterprise on Claude.ai), this aspect doesn't directly impact costs — but it can affect the monthly usage limits included in the plan. For those using the API with fixed budgets, recalibration is necessary. The guide on what Claude costs for enterprises is the updated reference for planning.

When upgrading makes sense: use case analysis

Not all workflows benefit equally from moving to Opus 4.7. A use-case-by-category analysis helps understand where to invest.

Coding and software development: upgrade recommended without reservation. The combined CursorBench +12pp and Rakuten 3x indicates real, measurable improvement. If the team uses Claude to generate code, do code review or solve bugs on complex codebases, Opus 4.7 will produce better output. The Claude Opus 4.7 for coding article covers practical cases.

Visual document analysis: necessary upgrade. The jump from 54.5% to 98.5% on visual acuity completely changes the viability of Opus 4.7 for scanned documents, invoices, paper contracts, technical diagrams. With Opus 4.6 reliability was insufficient for professional use; with 4.7 it becomes a practical tool.

Legal workflows: with 90.9% on BigLaw Bench, Opus 4.7 is significantly more precise on complex legal tasks. For law firms and compliance teams, the upgrade pays for itself quickly.

Data analysis and BI: the 21% fewer errors on OfficeQA and the Finance module improvement make the upgrade useful for financial and business intelligence workflows. The Claude Opus 4.7 for data analysis article details the benchmarks.

Simple or high-volume tasks: here the calculation changes. If your workflows use Opus for tasks that Sonnet could handle with comparable quality, this is the time to reassess model routing — not to automatically migrate to Opus 4.7. The updated tokenizer can erode economic advantages.

Claude Opus 4.7: All the new features of April 16, 2026