Back to Directory

NVIDIA: Nemotron Nano 9B V2

NVIDIA Developer Architecture Profile

Intelligence (ELO)1050Chatbot Arena Verified
Max Context131,072Tokens
API Cost / 1M$0.20Blended Prompt + Completion

Model Capabilities

    NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

    Granular Pricing Matrix

    Input Tokens (Prompt)$0.04 / 1M
    Output Tokens (Completion)$0.16 / 1M

    Pricing data via OpenRouter. Sync: 3/16/2026

    Evaluate Competitors