AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models

free_breakfast 30 free

business 58 providers

apps All 346 psychology Reasoning 136 visibility Vision 98 chat Chat 97 code Code 14 circle models.category.moderation 1

All Openai · 64 Qwen · 49 Google · 30 Mistralai · 19 Anthropic · 18 Meta-llama · 13 Z-ai · 12 Deepseek · 12 Nvidia · 11 Minimax · 8 Moonshotai · 6 Cohere · 5 Nousresearch · 5 Perplexity · 5 Amazon · 5

OpenAI: GPT-3.5 Turbo

openai/gpt-3.5-turbo

star Featured Code

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

data_array 16K context

OpenAI: GPT-3.5 Turbo (older v0613)

openai/gpt-3.5-turbo-0613

star Featured Code

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

data_array 4K context

Arcee AI: Coder Large

arcee-ai/coder-large

Code

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...

data_array 33K context

Cohere: North Mini Code (free)

cohere/north-mini-code:free

free_breakfast Free Code

North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...

data_array 256K context

Kwaipilot: KAT-Coder-Pro V2

kwaipilot/kat-coder-pro-v2

Code

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...

data_array 256K context

Mistral: Codestral 2508

mistralai/codestral-2508

Code

Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)

data_array 256K context

Morph: Morph V3 Fast

morph/morph-v3-fast

Code

Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update>...

data_array 82K context

Morph: Morph V3 Large

morph/morph-v3-large

Code

Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code>...

data_array 262K context

Owl Alpha

openrouter/owl-alpha

free_breakfast Free Code

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....

data_array 1,049K context

Qwen: Qwen3 Coder 30B A3B Instruct

qwen/qwen3-coder-30b-a3b-instruct

Code

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

data_array 160K context

Qwen: Qwen3 Coder Flash

qwen/qwen3-coder-flash

Code

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

data_array 1,000K context

Qwen: Qwen3 Coder Next

qwen/qwen3-coder-next

Code

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...

data_array 262K context

Qwen: Qwen3 Coder Plus

qwen/qwen3-coder-plus

Code

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

data_array 1,000K context

Relace: Relace Apply 3

relace/relace-apply-3

Code

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at...

data_array 256K context