
DeepSeek-Math
Specialized mathematics model; three variants (Base, Instruct, RL); optimized for STEM problem-solving
Reviews
No Reviews Yet
Be the first to share your experience with this AI tool
More models from Deepseek
V3.1 refinement (September 2025); improved language consistency; enhanced agent performance; more stable outputs
Experimental model (September 29, 2025); DeepSeek Sparse Attention (DSA) mechanism; 50-75% lower inference costs; long-context optimization
Production release (December 1, 2025); 671B parameters; DeepSeek Sparse Attention; GPT-5-level reasoning; 128K context
Hybrid model (August 2025); 671B parameters; dual-mode (thinking + non-thinking); 128K context; enhanced tool calling
Extended reasoning variant (December 1, 2025); extreme thinking mode; 96% AIME score; gold IMO 2025; outperforms GPT-5-High
Second generation MoE model; 236B total parameters (21B active); 128K context window; 50% lower training cost
Advanced coding model; 236B parameters (21B active); 128K context; 338 programming languages; GPT-4-Turbo-level coding
Refinement of V2; improved training data; enhanced transformer architecture; increased computational power
First commercial-grade coding model; 1.3B-33B parameters; supports 80+ programming languages
General-purpose foundation model; 67B parameters; outperforms LLaMA-2-70B on reasoning and math
Mixture of Experts model; efficient inference; reduced memory and computational requirements
Third generation foundation; 671B parameters (37B active); 128K context; major leap in capability and reasoning