The model is an auto-regressive vision language model that uses an optimized transformer architecture. The model enables multi-image reasoning and video understanding, along with strong document intelligence, visual Q&A and summarization capabilities.
import { streamText } from 'ai'
const result = streamText({ model: 'nvidia/nemotron-nano-12b-v2-vl', prompt: 'Why is the sky blue?'})Try out Nvidia Nemotron Nano 12B V2 VL by Nvidia. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.
Chat with
Powered by AI Gateway