Xiaomi: MiMo-V2-Omni
XIAOMI Developer Architecture Profile
Intelligence (ELO)1191Chatbot Arena Verified
Max Context262,144Tokens
API Cost / 1M$2.40Blended Prompt + Completion
Model Capabilities
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities. 256K context window.
Granular Pricing Matrix
Input Tokens (Prompt)$0.40 / 1M
Output Tokens (Completion)$2.00 / 1M
Pricing data via OpenRouter. Sync: 3/19/2026
Evaluate Competitors
VS Engine MatchupXiaomi: MiMo-V2-Omni vs MiniMax: MiniMax M2.7VS Engine MatchupXiaomi: MiMo-V2-Omni vs Mistral: Mistral Small 4VS Engine MatchupXiaomi: MiMo-V2-Omni vs Z.ai: GLM 5 TurboVS Engine MatchupXiaomi: MiMo-V2-Omni vs NVIDIA: Nemotron 3 SuperVS Engine MatchupXiaomi: MiMo-V2-Omni vs ByteDance Seed: Seed-2.0-LiteVS Engine MatchupXiaomi: MiMo-V2-Omni vs Inception: Mercury 2