AI Models — CloudAPI

Qwen: Qwen3 Coder 480B A35B

qwen/qwen3-coder

Reasoning

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

Input

3 632/M so'm

$0.29

Output

29 718/M so'm

$2.34

data_array 1,049K context

Qwen: Qwen3 Coder 480B A35B (free)

qwen/qwen3-coder:free

free_breakfast Free Reasoning

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

Input

Free

Output

Free

data_array 1,049K context

Qwen: Qwen3 Coder Flash

qwen/qwen3-coder-flash

Code

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

Input

3 219/M so'm

$0.25

Output

16 097/M so'm

$1.27

data_array 1,000K context

Qwen: Qwen3 Coder Next

qwen/qwen3-coder-next

Code

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...

Input

1 816/M so'm

$0.14

Output

13 208/M so'm

$1.04

data_array 262K context

Qwen: Qwen3 Coder Plus

qwen/qwen3-coder-plus

Code

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

Input

10 732/M so'm

$0.85

Output

53 658/M so'm

$4.23

data_array 1,000K context

Qwen: Qwen3 Max

qwen/qwen3-max

Reasoning

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

Input

12 878/M so'm

$1.01

Output

64 389/M so'm

$5.07

data_array 262K context

Qwen: Qwen3 Max Thinking

qwen/qwen3-max-thinking

Reasoning

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

Input

12 878/M so'm

$1.01

Output

64 389/M so'm

$5.07

data_array 262K context

Qwen: Qwen3 Next 80B A3B Instruct

qwen/qwen3-next-80b-a3b-instruct

Reasoning

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

Input

1 486/M so'm

$0.12

Output

18 161/M so'm

$1.43

data_array 262K context

Qwen: Qwen3 Next 80B A3B Instruct (free)

qwen/qwen3-next-80b-a3b-instruct:free

free_breakfast Free Reasoning

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

Input

Free

Output

Free

data_array 262K context

Qwen: Qwen3 Next 80B A3B Thinking

qwen/qwen3-next-80b-a3b-thinking

Reasoning

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

Input

1 610/M so'm

$0.13

Output

12 878/M so'm

$1.01

data_array 262K context

Qwen: Qwen3 VL 235B A22B Instruct

qwen/qwen3-vl-235b-a22b-instruct

Vision

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

Input

3 302/M so'm

$0.26

Output

14 529/M so'm

$1.14

data_array 262K context

Qwen: Qwen3 VL 235B A22B Thinking

qwen/qwen3-vl-235b-a22b-thinking

Reasoning

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

Input

4 293/M so'm

$0.34

Output

42 926/M so'm

$3.38

data_array 131K context

Qwen: Qwen3 VL 30B A3B Instruct

qwen/qwen3-vl-30b-a3b-instruct

Vision

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Input

2 146/M so'm

$0.17

Output

8 585/M so'm

$0.68

data_array 262K context

Qwen: Qwen3 VL 30B A3B Thinking

qwen/qwen3-vl-30b-a3b-thinking

Reasoning

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

Input

2 146/M so'm

$0.17

Output

25 756/M so'm

$2.03

data_array 131K context

Qwen: Qwen3 VL 32B Instruct

qwen/qwen3-vl-32b-instruct

Reasoning

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Input

1 717/M so'm

$0.14

Output

6 868/M so'm

$0.54

data_array 262K context

Qwen: Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct

Reasoning

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

Input

1 321/M so'm

$0.10

Output

8 255/M so'm

$0.65

data_array 256K context

Qwen: Qwen3 VL 8B Thinking

qwen/qwen3-vl-8b-thinking

Reasoning

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

Input

1 932/M so'm

$0.15

Output

22 536/M so'm

$1.77

data_array 256K context

Qwen: Qwen3.5 397B A17B

qwen/qwen3.5-397b-a17b

Vision

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers...

Input

6 356/M so'm

$0.50

Output

40 450/M so'm

$3.19

data_array 256K context

Qwen: Qwen3.5 Plus 2026-02-15

qwen/qwen3.5-plus-02-15

Vision

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency. In a variety of...

Input

4 293/M so'm

$0.34

Output

25 756/M so'm

$2.03

data_array 1,000K context

Qwen: Qwen3.5 Plus 2026-04-20

qwen/qwen3.5-plus-20260420

Vision

Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...

Input

4 953/M so'm

$0.39

Output

29 718/M so'm

$2.34

data_array 1,000K context

Qwen: Qwen3.5-122B-A10B

qwen/qwen3.5-122b-a10b

Vision

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of...

Input

4 293/M so'm

$0.34

Output

34 341/M so'm

$2.70

data_array 262K context

Qwen: Qwen3.5-27B

qwen/qwen3.5-27b

Vision

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...

Input

3 219/M so'm

$0.25

Output

25 756/M so'm

$2.03

data_array 262K context

Qwen: Qwen3.5-35B-A3B

qwen/qwen3.5-35b-a3b

Vision

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall...

Input

2 311/M so'm

$0.18

Output

16 510/M so'm

$1.30

data_array 262K context

Qwen: Qwen3.5-9B

qwen/qwen3.5-9b

Reasoning

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...

Input

1 651/M so'm

$0.13

Output

2 477/M so'm

$0.20

data_array 262K context