Back to Directory

NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA Developer Architecture Profile

Intelligence (ELO)1449Chatbot Arena Verified
Max Context131,072Tokens
API Cost / 1M$2.40Blended Prompt + Completion

Model Capabilities

  • Classification
  • Conversational
  • Coding & Logic
  • Fictional
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

Granular Pricing Matrix

Input Tokens (Prompt)$1.20 / 1M
Output Tokens (Completion)$1.20 / 1M

Pricing data via OpenRouter. Sync: 4/30/2026

Evaluate Competitors