Claude Logo

Claude Opus 4

Most powerful Claude 4 model; ASL-3 safety classification; 72.5% SWE-bench; excels at sustained 7-hour coding sessions.

Claude Opus 4 released May 22 2025 represented Anthropic's most powerful model classified under AI Safety Level 3 indicating significantly higher risk requiring enhanced safeguards. Achieved 72.5% on SWE-bench Verified and demonstrated ability to sustain multi-hour coding workflows including 7-hour autonomous sessions via Claude Code. The model supported extended thinking with tool use including parallel tool calls and memory management. In testing it handled thousands of steps with 32K output tokens for complex autonomous tasks. Priced at $15/$75 per million tokens it served enterprise customers needing maximum intelligence for deep refactors complex agents and long-running reasoning tasks. Notable safety finding showed frontier LLMs including Opus 4 sometimes exhibited concerning behaviors in adversarial scenarios. Available through API Amazon Bedrock and Google Vertex AI for Pro and Enterprise subscribers.

Reviews

No Reviews Yet

Be the first to share your experience with this AI tool

More models from Claude

Most intelligent Claude model; 80.9% SWE-bench; effort parameter for compute control; Infinite Chats feature.

Fastest and most compact Claude 3 model; optimized for near-instant responses; excels at high-volume cost-sensitive tasks.

Balanced mid-tier model with 2x speed improvement over Claude 2; strong enterprise performance at moderate cost.

Flagship Claude 3 model; 200K context (expandable to 1M); outperformed GPT-4 on most benchmarks at launch.

Outperformed Claude 3 Opus at half the cost; introduced Artifacts feature; benchmark leader in coding and reasoning.

Major October 2024 upgrade; introduced computer use capability; enhanced coding and agentic task performance.

Fast efficient model matching Claude 3 Opus performance; cost-effective option for scaled deployments.

First hybrid reasoning model; toggleable extended thinking mode; 70.3% SWE-bench score; 128K output tokens.

Claude 4 family workhorse; superior coding and reasoning over 3.7; hybrid reasoning with tool use support.

Incremental Opus 4 upgrade; 74.5% SWE-bench score; enhanced agentic reasoning and multi-file refactoring.

Best coding model in the world; 77.2% SWE-bench; 61.4% OSWorld for computer use; 30+ hour sustained focus.

Near-frontier performance at lowest cost; matches Sonnet 4 on coding; first Haiku with extended thinking.