DeepSeek-LLM

General-purpose foundation model; 67B parameters; outperforms LLaMA-2-70B on reasoning and math

Launched in November 2023 shortly after DeepSeek-Coder, DeepSeek-LLM introduced the company's first general-purpose large language model with 67 billion parameters trained on 2 trillion tokens across English and Chinese. Despite being smaller than contemporary competitors, it demonstrated superior performance on reasoning, coding, mathematics, and Chinese comprehension benchmarks compared to LLaMA-2-70B. The model was released as open-source in base and chat variants, reinforcing DeepSeek's commitment to democratized AI development. Its strong performance-to-parameter ratio made it a notable achievement for 2023 and established DeepSeek as a serious contender in the open-source LLM space.

Reviews

No Reviews Yet

Be the first to share your experience with this AI tool

DeepSeek-LLM

Reviews

No Reviews Yet

More models from Deepseek

DeepSeek-V3.1-Terminus

DeepSeek-V3.2-Exp

DeepSeek-V3.2

DeepSeek-V3.1

DeepSeek-V3.2-Speciale

DeepSeek-Math

DeepSeek-V2

DeepSeek-Coder-V2

DeepSeek-V2.5

DeepSeek-Coder

DeepSeek-MoE

DeepSeek-V3