Claude Logo

Claude 3.7 Sonnet

First hybrid reasoning model; toggleable extended thinking mode; 70.3% SWE-bench score; 128K output tokens.

Claude 3.7 Sonnet released February 24 2025 pioneered hybrid reasoning architecture allowing users to choose between rapid responses and deep step-by-step thinking within a single model. In standard mode it served as an upgraded Claude 3.5 Sonnet. Extended thinking mode enabled self-reflection before answering improving performance on math physics coding and instruction-following. Developers could set thinking budgets up to 128K tokens balancing cost speed and quality. Achieved 70.3% on SWE-bench Verified outperforming OpenAI o1 and o3-mini. Released alongside Claude Code the agentic command-line tool for delegating coding tasks. Reduced unnecessary refusals by 45% compared to predecessors while maintaining safety. The model demonstrated 84.8% on GPQA Diamond and 80% on AIME 2024 math problems. This release established the hybrid reasoning paradigm that defined Anthropic's subsequent model strategy.

Reviews

No Reviews Yet

Be the first to share your experience with this AI tool

More models from Claude

Most intelligent Claude model; 80.9% SWE-bench; effort parameter for compute control; Infinite Chats feature.

Fastest and most compact Claude 3 model; optimized for near-instant responses; excels at high-volume cost-sensitive tasks.

Balanced mid-tier model with 2x speed improvement over Claude 2; strong enterprise performance at moderate cost.

Flagship Claude 3 model; 200K context (expandable to 1M); outperformed GPT-4 on most benchmarks at launch.

Outperformed Claude 3 Opus at half the cost; introduced Artifacts feature; benchmark leader in coding and reasoning.

Major October 2024 upgrade; introduced computer use capability; enhanced coding and agentic task performance.

Fast efficient model matching Claude 3 Opus performance; cost-effective option for scaled deployments.

Claude 4 family workhorse; superior coding and reasoning over 3.7; hybrid reasoning with tool use support.

Most powerful Claude 4 model; ASL-3 safety classification; 72.5% SWE-bench; excels at sustained 7-hour coding sessions.

Incremental Opus 4 upgrade; 74.5% SWE-bench score; enhanced agentic reasoning and multi-file refactoring.

Best coding model in the world; 77.2% SWE-bench; 61.4% OSWorld for computer use; 30+ hour sustained focus.

Near-frontier performance at lowest cost; matches Sonnet 4 on coding; first Haiku with extended thinking.