Skip to content

MiniMax M2

MiniMax M2 is an open-weight MoE model from MiniMax with 230B total parameters and 10B active per forward pass, built for coding and agentic tasks. It supports a context window of 205K tokens and a max output of 205K tokens per request.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m2',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
MiniMax
205K
0.7s
88tps
$0.30/M$1.20/M
Read:$0.03/M
Write:$0.38/M
+1
10/27/2025
Novita AI
205K
0.9s
91tps
$0.30/M$1.20/M
Read:$0.03/M
Write:
10/27/2025