The Regularizer
ModelsProvidersBenchmarksBlog
•

The Regularizer

Your comprehensive resource for AI model information, benchmarks, and comparisons.

Quick Links

  • Models
  • Providers
  • Blog

About

The Regularizer is a platform dedicated to tracking and comparing AI models and their capabilities.

© 2025 The Regularizer. All rights reserved.

← Back to Models
Commercial
MIT License for code; DeepSeek License Agreement for models
open-source

DeepSeek-LLM

DeepSeek•Released Nov 1, 2023•Updated May 4, 2025

Description

Open-source models from DeepSeek, with the largest variant (V3) at 671B parameters trained on English and Chinese text:contentReference[oaicite:45]{index=45}:contentReference[oaicite:46]{index=46}.

Technical Specifications

Parameters
671
Context Length
4.1K
Architecture
Pre-norm decoder-only Transformer with RMSNorm, SwiGLU, RoPE, and GQA
Score
33.9

Typical Use Cases

  • Text generation
  • Chatbots
  • Coding assistance
  • Mathematical problem-solving
  • Language translation

Model Information

Type
Commercial
License
MIT License for code; DeepSeek License Agreement for models
Category
open-source
Release Date
Nov 1, 2023
Provider Website
Visit Website