Skip to content

Nvidia Nemotron Nano 9B V2

Nvidia Nemotron Nano 9B V2 is a dense hybrid Mamba-Transformer reasoning model that matches or exceeds Qwen3-8B accuracy at up to 6x the throughput, with built-in thinking budget control.

ReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'nvidia/nemotron-nano-9b-v2',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Amazon Bedrock
131K
0.2s
150tps
$0.06/M$0.23/M
08/18/2025
DeepInfra
131K
0.2s
101tps
$0.04/M$0.16/M
08/18/2025