MiniMax M3

MiniMax M3 is MiniMax's first model with a 1M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1M tokens per request.

ReasoningTool UseVision (Image)File InputImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m3',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

MiniMax

3.7s

40tps

$0.60/M$0.30/M

$2.40/M$1.20/M

Read:

$0.12/M$0.06/M

Write:

—

05/31/2026

Fireworks

512K

1.2s

160tps

$0.30/M

$1.20/M

Read:$0.06/M

Write:—

—

05/31/2026

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiniMax M3

Providers