AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turbo
star Featured Code
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
Input
8 255/M so'm
$0.65
Output
24 765/M so'm
$1.95
data_array 16K context
OpenAI: GPT-3.5 Turbo (older v0613)
openai/gpt-3.5-turbo-0613
star Featured Code
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
Input
16 510/M so'm
$1.30
Output
33 020/M so'm
$2.60
data_array 4K context
Arcee AI: Coder Large
arcee-ai/coder-large
Code
Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...
Input
8 255/M so'm
$0.65
Output
13 208/M so'm
$1.04
data_array 33K context
Cohere: North Mini Code (free)
cohere/north-mini-code:free
free_breakfast Free Code
North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...
Input
Free
Output
Free
data_array 256K context
Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v2
Code
KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...
Input
4 953/M so'm
$0.39
Output
19 812/M so'm
$1.56
data_array 256K context
Mistral: Codestral 2508
mistralai/codestral-2508
Code
Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)
Input
4 953/M so'm
$0.39
Output
14 859/M so'm
$1.17
data_array 256K context
Morph: Morph V3 Fast
morph/morph-v3-fast
Code
Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update>...
Input
13 208/M so'm
$1.04
Output
19 812/M so'm
$1.56
data_array 82K context
Morph: Morph V3 Large
morph/morph-v3-large
Code
Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code>...
Input
14 859/M so'm
$1.17
Output
31 369/M so'm
$2.47
data_array 262K context
Owl Alpha
openrouter/owl-alpha
free_breakfast Free Code
Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....
Input
Free
Output
Free
data_array 1,049K context
Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instruct
Code
Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...
Input
1 156/M so'm
$0.09
Output
4 458/M so'm
$0.35
data_array 160K context
Qwen: Qwen3 Coder Flash
qwen/qwen3-coder-flash
Code
Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...
Input
3 219/M so'm
$0.25
Output
16 097/M so'm
$1.27
data_array 1,000K context
Qwen: Qwen3 Coder Next
qwen/qwen3-coder-next
Code
Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...
Input
1 816/M so'm
$0.14
Output
13 208/M so'm
$1.04
data_array 262K context
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus
Code
Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...
Input
10 732/M so'm
$0.85
Output
53 658/M so'm
$4.23
data_array 1,000K context
Relace: Relace Apply 3
relace/relace-apply-3
Code
Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at...
Input
14 034/M so'm
$1.11
Output
20 638/M so'm
$1.63
data_array 256K context