Skip to content
Seedance 2.0 Face is here — generate video from real-person reference photos.Try it now
Back to Models
minimaxminimax·May 31, 2026

MiniMax: MiniMax M3

minimax/minimax-m3
context
1.0M
Max Output
512K
Input / 1M
$0.60
Output / 1M
$2.40

About

MiniMax-M3 is a natively multimodal Mixture-of-Experts foundation model from MiniMax with 229.9B total parameters, activating about 9.8B per token across 256 fine-grained experts. It supports text, image, and video inputs with text output, a 1M-token context window, and up to 512K output tokens, and is suited for long-horizon agentic work, coding, and tool use. It is built on MiniMax Sparse Attention (MSA), which uses a lightweight index branch to select relevant KV-cache blocks so the main attention layer processes only those blocks, cutting per-token compute at 1M context to roughly 1/20 of the previous M2 generation. MSA delivers about 9.7x faster prefill and 15.6x faster decode at 1M tokens versus M2 while retaining accuracy, and the model reaches 59.0% on SWE-bench Pro.

Capabilities

Context Length
1.0M
Max Output
512K
Reasoning
Yes
In
text, image, video
Out
text
44.4#15 of 133
Intelligence Index
43.4#27 of 118
Coding Index

Reasoning & Knowledge

GPQA Diamond92.9%
HLE37.1%

Coding & Agentic

SciCode45.4%
Terminal-Bench Hard42.4%

Source: Artificial Analysis

TypePrice / 1M tokens
Input$0.60
Output$2.40
Cache Read$0.12

OpenAI-compatible · Model ID minimax/minimax-m3

curl https://api.elliotgate.com/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax/minimax-m3",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'