Qwen3 VL 30B A3B Instruct

Qwen · qwen/qwen3-vl-30b-a3b-instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

open weightsimagetexttext+image->text

Context

Max context: 262144
Max output: 16384

Pricing

Input / 1M: 0.15
Output / 1M: 0.60
Blend / 1M: 0.38

Quality

Quality index: —

Provider

Provider: Qwen
Moderated: no