NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA Developer Architecture Profile

Intelligence (ELO)1449Chatbot Arena Verified

Max Context131,072Tokens

API Cost / 1M$2.40Blended Prompt + Completion

Model Capabilities

Classification
Conversational
Coding & Logic
Fictional

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

Granular Pricing Matrix

Input Tokens (Prompt)$1.20 / 1M

Output Tokens (Completion)$1.20 / 1M

Pricing data via OpenRouter. Sync: 4/30/2026

Evaluate Competitors

VS Engine MatchupNVIDIA: Llama 3.1 Nemotron 70B Instruct vs Nous: Hermes 4 70B VS Engine MatchupNVIDIA: Llama 3.1 Nemotron 70B Instruct vs Meta: Llama 3.1 70B Instruct VS Engine MatchupNVIDIA: Llama 3.1 Nemotron 70B Instruct vs NousResearch: Hermes 2 Pro - Llama-3 8B VS Engine MatchupNVIDIA: Llama 3.1 Nemotron 70B Instruct vs Anthropic Claude Sonnet Latest VS Engine MatchupNVIDIA: Llama 3.1 Nemotron 70B Instruct vs Qwen: Qwen3.5-9B VS Engine MatchupNVIDIA: Llama 3.1 Nemotron 70B Instruct vs Qwen: Qwen3.5 397B A17B