Qwen3 VL 235B A22B Instruct

Qwen · qwen/qwen3-vl-235b-a22b-instruct

← Back to leaderboard

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

open weightsimagetexttext+image->text

Context

Max context: 262144
Max output: 16384

Pricing

Input / 1M: 0.20
Output / 1M: 0.88
Blend / 1M: 0.54

Quality

Quality index: 68.6

Provider

Provider: Qwen
Moderated: no