Claude Logo

Claude 3.5 Sonnet (New)

Major October 2024 upgrade; introduced computer use capability; enhanced coding and agentic task performance.

Claude 3.5 Sonnet (New) released October 22 2024 brought substantial improvements over the June version. Most notably it introduced computer use capability in public beta allowing Claude to interact with desktop environments by moving cursors clicking buttons and typing text to perform multi-step tasks across applications. Performance improved significantly on coding benchmarks and agentic reasoning tasks. The model showed marked gains in handling complex instructions humor nuance and sustained multi-application workflows. METR evaluations showed its autonomous capabilities matched what human baseliners achieve in approximately one hour. This update positioned Claude as a pioneer in practical AI agents capable of operating software interfaces. The computer use feature opened new possibilities for automation testing and accessibility applications. Deprecated and retired October 2025.

Reviews

No Reviews Yet

Be the first to share your experience with this AI tool

More models from Claude

Most intelligent Claude model; 80.9% SWE-bench; effort parameter for compute control; Infinite Chats feature.

Fastest and most compact Claude 3 model; optimized for near-instant responses; excels at high-volume cost-sensitive tasks.

Balanced mid-tier model with 2x speed improvement over Claude 2; strong enterprise performance at moderate cost.

Flagship Claude 3 model; 200K context (expandable to 1M); outperformed GPT-4 on most benchmarks at launch.

Outperformed Claude 3 Opus at half the cost; introduced Artifacts feature; benchmark leader in coding and reasoning.

Fast efficient model matching Claude 3 Opus performance; cost-effective option for scaled deployments.

First hybrid reasoning model; toggleable extended thinking mode; 70.3% SWE-bench score; 128K output tokens.

Claude 4 family workhorse; superior coding and reasoning over 3.7; hybrid reasoning with tool use support.

Most powerful Claude 4 model; ASL-3 safety classification; 72.5% SWE-bench; excels at sustained 7-hour coding sessions.

Incremental Opus 4 upgrade; 74.5% SWE-bench score; enhanced agentic reasoning and multi-file refactoring.

Best coding model in the world; 77.2% SWE-bench; 61.4% OSWorld for computer use; 30+ hour sustained focus.

Near-frontier performance at lowest cost; matches Sonnet 4 on coding; first Haiku with extended thinking.