AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
OpenAI: gpt-oss-120b (free)
openai/gpt-oss-120b:free
free_breakfast Free star Featured Reasoning
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
Input
Free
Output
Free
data_array 131K context
OpenAI: gpt-oss-safeguard-20b
openai/gpt-oss-safeguard-20b
star Featured Reasoning
gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...
Input
1 238/M so'm
$0.10
Output
4 953/M so'm
$0.39
data_array 131K context
OpenAI: o1
openai/o1
star Featured Reasoning
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...
Input
247 650/M so'm
$19.50
Output
990 600/M so'm
$78.00
data_array 200K context
OpenAI: o1-pro
openai/o1-pro
star Featured Reasoning
The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide...
Input
2 476 500/M so'm
$195.00
Output
9 906 000/M so'm
$780.00
data_array 200K context
OpenAI: o3
openai/o3
star Featured Reasoning
o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....
Input
33 020/M so'm
$2.60
Output
132 080/M so'm
$10.40
data_array 200K context
OpenAI: o3 Deep Research
openai/o3-deep-research
star Featured Reasoning
o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.
Input
165 100/M so'm
$13.00
Output
660 400/M so'm
$52.00
data_array 200K context
OpenAI: o3 Mini
openai/o3-mini
star Featured Reasoning
OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to...
Input
18 161/M so'm
$1.43
Output
72 644/M so'm
$5.72
data_array 200K context
OpenAI: o3 Mini High
openai/o3-mini-high
star Featured Reasoning
OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and...
Input
18 161/M so'm
$1.43
Output
72 644/M so'm
$5.72
data_array 200K context
OpenAI: o3 Pro
openai/o3-pro
star Featured Reasoning
The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...
Input
330 200/M so'm
$26.00
Output
1 320 800/M so'm
$104.00
data_array 200K context
OpenAI: o4 Mini
openai/o4-mini
star Featured Reasoning
OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...
Input
18 161/M so'm
$1.43
Output
72 644/M so'm
$5.72
data_array 200K context
OpenAI: o4 Mini High
openai/o4-mini-high
star Featured Reasoning
OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...
Input
18 161/M so'm
$1.43
Output
72 644/M so'm
$5.72
data_array 200K context
DeepSeek: R1
deepseek/deepseek-r1
star Featured Reasoning
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....
Input
11 557/M so'm
$0.91
Output
41 275/M so'm
$3.25
data_array 164K context
AionLabs: Aion-1.0
aion-labs/aion-1.0
Reasoning
Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree...
Input
66 040/M so'm
$5.20
Output
132 080/M so'm
$10.40
data_array 131K context
AionLabs: Aion-1.0-Mini
aion-labs/aion-1.0-mini
Reasoning
Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...
Input
11 557/M so'm
$0.91
Output
23 114/M so'm
$1.82
data_array 131K context
AllenAI: Olmo 3 32B Think
allenai/olmo-3-32b-think
Reasoning
Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...
Input
2 477/M so'm
$0.20
Output
8 255/M so'm
$0.65
data_array 66K context
Amazon: Nova 2 Lite
amazon/nova-2-lite-v1
Reasoning
Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...
Input
4 953/M so'm
$0.39
Output
41 275/M so'm
$3.25
data_array 1,000K context
Amazon: Nova Premier 1.0
amazon/nova-premier-v1
Reasoning
Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.
Input
41 275/M so'm
$3.25
Output
206 375/M so'm
$16.25
data_array 1,000K context
Arcee AI: Trinity Large Thinking
arcee-ai/trinity-large-thinking
Reasoning
Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...
Input
4 128/M so'm
$0.33
Output
13 208/M so'm
$1.04
data_array 262K context
Arcee AI: Trinity Mini
arcee-ai/trinity-mini
Reasoning
Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...
Input
743/M so'm
$0.06
Output
2 477/M so'm
$0.20
data_array 131K context
Arcee AI: Virtuoso Large
arcee-ai/virtuoso-large
Reasoning
Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...
Input
12 383/M so'm
$0.98
Output
19 812/M so'm
$1.56
data_array 131K context
ByteDance Seed: Seed-2.0-Mini
bytedance-seed/seed-2.0-mini
Reasoning
Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, supports 256k context, four reasoning effort modes (minimal/low/medium/high), multimodal understanding,...
Input
1 651/M so'm
$0.13
Output
6 604/M so'm
$0.52
data_array 262K context
Cohere: Command R (08-2024)
cohere/command-r-08-2024
Reasoning
command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...
Input
2 477/M so'm
$0.20
Output
9 906/M so'm
$0.78
data_array 128K context
Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024
Reasoning
Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...
Input
619/M so'm
$0.05
Output
2 477/M so'm
$0.20
data_array 128K context
EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instruct
Reasoning
Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...
Input
2 477/M so'm
$0.20
Output
2 477/M so'm
$0.20
data_array 33K context