Llama 3.3 70B Instruct
META-LLAMA Developer Architecture Profile
Intelligence (ELO)1250Chatbot Arena Verified
Max Context131,072Tokens
API Cost / 1M$0.42Blended Prompt + Completion
Model Capabilities
- Classification
- Conversational
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks.
Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
[Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md)
Granular Pricing Matrix
Input Tokens (Prompt)$0.10 / 1M
Output Tokens (Completion)$0.32 / 1M
Pricing data via OpenRouter. Sync: 3/16/2026
Evaluate Competitors
VS Engine MatchupLlama 3.3 70B Instruct vs Anthropic: Claude Opus 4VS Engine MatchupLlama 3.3 70B Instruct vs OpenAI: GPT-4 TurboVS Engine MatchupLlama 3.3 70B Instruct vs OpenAI: GPT-4 Turbo (older v1106)VS Engine MatchupLlama 3.3 70B Instruct vs Qwen: Qwen2.5 VL 72B InstructVS Engine MatchupLlama 3.3 70B Instruct vs DeepSeek: DeepSeek V3.2 SpecialeVS Engine MatchupLlama 3.3 70B Instruct vs DeepSeek: DeepSeek V3.2