Qwen3 VL 30B A3B Instruct

Qwen · qwen/qwen3-vl-30b-a3b-instruct

← Back to leaderboard

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

open weightsimagetexttext+image->text

Context

Max context: 131072
Max output: 32768

Pricing

Input / 1M: 0.13
Output / 1M: 0.52
Blend / 1M: 0.33

Quality

Quality index:

Provider

Provider: Qwen
Moderated: no