Qwen: Qwen3.5-Flash
QWEN Developer Architecture Profile
Intelligence (ELO)1150Chatbot Arena Verified
Max Context1,000,000Tokens
API Cost / 1M$0.50Blended Prompt + Completion
Model Capabilities
- Drafting
- Classification
- Vision
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
Granular Pricing Matrix
Input Tokens (Prompt)$0.10 / 1M
Output Tokens (Completion)$0.40 / 1M
Pricing data via OpenRouter. Sync: 3/16/2026
Evaluate Competitors
VS Engine MatchupQwen: Qwen3.5-Flash vs ByteDance Seed: Seed-2.0-LiteVS Engine MatchupQwen: Qwen3.5-Flash vs Qwen: Qwen3.5-35B-A3BVS Engine MatchupQwen: Qwen3.5-Flash vs MiniMax: MiniMax M2.5 (free)VS Engine MatchupQwen: Qwen3.5-Flash vs MiniMax: MiniMax M2.5VS Engine MatchupQwen: Qwen3.5-Flash vs StepFun: Step 3.5 Flash (free)VS Engine MatchupQwen: Qwen3.5-Flash vs StepFun: Step 3.5 Flash