Back to Directory

Xiaomi: MiMo-V2-Omni

XIAOMI Developer Architecture Profile

Intelligence (ELO)1191Chatbot Arena Verified
Max Context262,144Tokens
API Cost / 1M$2.40Blended Prompt + Completion

Model Capabilities

    MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities. 256K context window.

    Granular Pricing Matrix

    Input Tokens (Prompt)$0.40 / 1M
    Output Tokens (Completion)$2.00 / 1M

    Pricing data via OpenRouter. Sync: 3/19/2026

    Evaluate Competitors