MiniMax M2

MiniMax M2 is an open-weight MoE model from MiniMax with 230B total parameters and 10B active per forward pass, built for coding and agentic tasks. It supports a context window of 205K tokens and a max output of 205K tokens per request.

ReasoningTool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

MiniMax

205K

0.7s

88tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

Novita AI

205K

0.9s

91tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:—

—

10/27/2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiniMax M2

Providers