Skip to content

MiniMax M3

MiniMax M3 is MiniMax's first model with a 1M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1M tokens per request.

ReasoningTool UseVision (Image)File InputImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m3',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
MiniMax
1M
3.7s
40tps
$0.60/M$0.30/M
$2.40/M$1.20/M
Read:
$0.12/M$0.06/M
Write:
+3
05/31/2026
Fireworks
512K
1.2s
160tps
$0.30/M$1.20/M
Read:$0.06/M
Write:
+3
05/31/2026