Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
Vision
Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
Kirish
13 208/M so'm
$1.04
Chiqish
16 510/M so'm
$1.30
data_array
131K context
Qwen: Qwen3.5 Plus 2026-04-20
qwen/qwen3.5-plus-20260420
Vision
Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...
Kirish
4 953/M so'm
$0.39
Chiqish
29 718/M so'm
$2.34
data_array
1,000K context
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flash
Vision
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...
Kirish
3 096/M so'm
$0.24
Chiqish
18 574/M so'm
$1.46
data_array
1,000K context
xAI: Grok 4.20 Multi-Agent
x-ai/grok-4.20-multi-agent
Vision
Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information...
Kirish
20 638/M so'm
$1.63
Chiqish
41 275/M so'm
$3.25
data_array
2,000K context