Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
Vision
Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
Input
13 208/M so'm
$1.04
Output
16 510/M so'm
$1.30
data_array
131K context
Qwen: Qwen3.5 Plus 2026-04-20
qwen/qwen3.5-plus-20260420
Vision
Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...
Output
29 718/M so'm
$2.34
data_array
1,000K context
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flash
Vision
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...
Output
18 574/M so'm
$1.46
data_array
1,000K context