Xiaomi: MiMo-V2-Omni
XIAOMI Developer Architecture Profile
Intelligence (ELO)1425Chatbot Arena Verified
Max Context262,144Tokens
API Cost / 1M$2.40Blended Prompt + Completion
Model Capabilities
- Audio Gen
- Coding & Logic
- Fictional
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
Granular Pricing Matrix
Input Tokens (Prompt)$0.40 / 1M
Output Tokens (Completion)$2.00 / 1M
Pricing data via OpenRouter. Sync: 5/3/2026
Evaluate Competitors
VS Engine MatchupXiaomi: MiMo-V2-Omni vs AllenAI: Olmo 3.1 32B InstructVS Engine MatchupXiaomi: MiMo-V2-Omni vs Mistral: Ministral 3 3B 2512VS Engine MatchupXiaomi: MiMo-V2-Omni vs Qwen: Qwen3 VL 8B ThinkingVS Engine MatchupXiaomi: MiMo-V2-Omni vs OpenAI: o4 Mini Deep ResearchVS Engine MatchupXiaomi: MiMo-V2-Omni vs DeepSeek: DeepSeek V3.2 ExpVS Engine MatchupXiaomi: MiMo-V2-Omni vs Mistral: Codestral 2508