AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder
Reasoning
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...
Input
3 632/M so'm
$0.29
Output
29 718/M so'm
$2.34
data_array 1,049K context
Qwen: Qwen3 Coder 480B A35B (free)
qwen/qwen3-coder:free
free_breakfast Free Reasoning
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...
Input
Free
Output
Free
data_array 1,049K context
Qwen: Qwen3 Coder Flash
qwen/qwen3-coder-flash
Code
Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...
Input
3 219/M so'm
$0.25
Output
16 097/M so'm
$1.27
data_array 1,000K context
Qwen: Qwen3 Coder Next
qwen/qwen3-coder-next
Code
Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...
Input
1 816/M so'm
$0.14
Output
13 208/M so'm
$1.04
data_array 262K context
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus
Code
Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...
Input
10 732/M so'm
$0.85
Output
53 658/M so'm
$4.23
data_array 1,000K context
Qwen: Qwen3 Max
qwen/qwen3-max
Reasoning
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...
Input
12 878/M so'm
$1.01
Output
64 389/M so'm
$5.07
data_array 262K context
Qwen: Qwen3 Max Thinking
qwen/qwen3-max-thinking
Reasoning
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
Input
12 878/M so'm
$1.01
Output
64 389/M so'm
$5.07
data_array 262K context
Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
Reasoning
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...
Input
1 486/M so'm
$0.12
Output
18 161/M so'm
$1.43
data_array 262K context
Qwen: Qwen3 Next 80B A3B Instruct (free)
qwen/qwen3-next-80b-a3b-instruct:free
free_breakfast Free Reasoning
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...
Input
Free
Output
Free
data_array 262K context
Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinking
Reasoning
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...
Input
1 610/M so'm
$0.13
Output
12 878/M so'm
$1.01
data_array 262K context
Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
Vision
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...
Input
3 302/M so'm
$0.26
Output
14 529/M so'm
$1.14
data_array 262K context
Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinking
Reasoning
Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....
Input
4 293/M so'm
$0.34
Output
42 926/M so'm
$3.38
data_array 131K context
Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
Vision
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
Input
2 146/M so'm
$0.17
Output
8 585/M so'm
$0.68
data_array 262K context
Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinking
Reasoning
Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...
Input
2 146/M so'm
$0.17
Output
25 756/M so'm
$2.03
data_array 131K context
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
Reasoning
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Input
1 717/M so'm
$0.14
Output
6 868/M so'm
$0.54
data_array 262K context
Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instruct
Reasoning
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...
Input
1 321/M so'm
$0.10
Output
8 255/M so'm
$0.65
data_array 256K context
Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking
Reasoning
Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...
Input
1 932/M so'm
$0.15
Output
22 536/M so'm
$1.77
data_array 256K context
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
Vision
The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers...
Input
6 356/M so'm
$0.50
Output
40 450/M so'm
$3.19
data_array 256K context
Qwen: Qwen3.5 Plus 2026-02-15
qwen/qwen3.5-plus-02-15
Vision
The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency. In a variety of...
Input
4 293/M so'm
$0.34
Output
25 756/M so'm
$2.03
data_array 1,000K context
Qwen: Qwen3.5 Plus 2026-04-20
qwen/qwen3.5-plus-20260420
Vision
Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...
Input
4 953/M so'm
$0.39
Output
29 718/M so'm
$2.34
data_array 1,000K context
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b
Vision
The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of...
Input
4 293/M so'm
$0.34
Output
34 341/M so'm
$2.70
data_array 262K context
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b
Vision
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...
Input
3 219/M so'm
$0.25
Output
25 756/M so'm
$2.03
data_array 262K context
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b
Vision
The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall...
Input
2 311/M so'm
$0.18
Output
16 510/M so'm
$1.30
data_array 262K context
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b
Reasoning
Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...
Input
1 651/M so'm
$0.13
Output
2 477/M so'm
$0.20
data_array 262K context