Llama 3.2 3B Instruct
META-LLAMA Developer Architecture Profile
Intelligence (ELO)1100Chatbot Arena Verified
Max Context80,000Tokens
API Cost / 1M$0.39Blended Prompt + Completion
Model Capabilities
- Classification
- Conversational
Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it supports eight languages, including English, Spanish, and Hindi, and is adaptable for additional languages.
Trained on 9 trillion tokens, the Llama 3.2 3B model excels in instruction-following, complex reasoning, and tool use. Its balanced performance makes it ideal for applications needing accuracy and efficiency in text generation across multilingual settings.
Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md).
Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).
Granular Pricing Matrix
Input Tokens (Prompt)$0.05 / 1M
Output Tokens (Completion)$0.34 / 1M
Pricing data via OpenRouter. Sync: 3/16/2026
Evaluate Competitors
VS Engine MatchupLlama 3.2 3B Instruct vs Mistral: Mixtral 8x7B InstructVS Engine MatchupLlama 3.2 3B Instruct vs Z.ai: GLM 5 TurboVS Engine MatchupLlama 3.2 3B Instruct vs Inception: Mercury 2VS Engine MatchupLlama 3.2 3B Instruct vs Qwen: Qwen3.5-27BVS Engine MatchupLlama 3.2 3B Instruct vs Qwen: Qwen3.5-122B-A10BVS Engine MatchupLlama 3.2 3B Instruct vs AionLabs: Aion-2.0