The Regularizer
ModelsProvidersBenchmarksBlog
•

The Regularizer

Your comprehensive resource for AI model information, benchmarks, and comparisons.

Quick Links

  • Models
  • Providers
  • Blog

About

The Regularizer is a platform dedicated to tracking and comparing AI models and their capabilities.

© 2025 The Regularizer. All rights reserved.

← Back to Models
Commercial
Model Agreement
open-source

Minimax-Text-01

MiniMax•Released Jan 14, 2025•Updated May 4, 2025

Description

A large-scale 456B-parameter Chinese model (2025) released by MiniMax, representing one of the largest dense models to date:contentReference[oaicite:50]{index=50}.

Technical Specifications

Parameters
456
Context Length
4000K
Architecture
{"total_parameters": 456000000000, "activated_parameters_per_token": 45900000000, "number_of_layers": 80, "attention_mechanism": {"type": "hybrid", "components": ["Lightning Attention", "Softmax Attention"], "configuration": {"lightning_attention_layers": 7, "softmax_attention_layers": 1, "attention_heads": 64, "attention_head_dimension": 128}}, "mixture_of_experts": {"number_of_experts": 32, "expert_hidden_dimension": 9216, "routing_strategy": "Top-2"}, "positional_encoding": {"type": "Rotary Position Embedding (RoPE)", "base_frequency": 10000000}, "hidden_size": 6144, "vocabulary_size": 200064}
Score
45.0

Typical Use Cases

  • long-context processing
  • natural language understanding
  • text generation
  • reasoning tasks

Model Information

Type
Commercial
License
Model Agreement
Category
open-source
Release Date
Jan 14, 2025
Provider Website
Visit Website