AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
star Featured Reasoning
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...
Input
4 953/M so'm
$0.39
Output
41 275/M so'm
$3.25
data_array 1,049K context
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
star Featured Reasoning
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Input
1 651/M so'm
$0.13
Output
6 604/M so'm
$0.52
data_array 1,049K context
Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025
star Featured Reasoning
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Input
1 651/M so'm
$0.13
Output
6 604/M so'm
$0.52
data_array 1,049K context
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
star Featured Reasoning
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Input
20 638/M so'm
$1.63
Output
165 100/M so'm
$13.00
data_array 1,049K context
Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06
star Featured Reasoning
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Input
20 638/M so'm
$1.63
Output
165 100/M so'm
$13.00
data_array 1,049K context
Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-preview
star Featured Reasoning
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Input
20 638/M so'm
$1.63
Output
165 100/M so'm
$13.00
data_array 1,049K context
Google: Gemini 3 Flash Preview
google/gemini-3-flash-preview
star Featured Reasoning
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...
Input
8 255/M so'm
$0.65
Output
49 530/M so'm
$3.90
data_array 1,049K context
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
star Featured Reasoning
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...
Input
33 020/M so'm
$2.60
Output
198 120/M so'm
$15.60
data_array 1,049K context
Google: Gemini 3.5 Flash
google/gemini-3.5-flash
star Featured Reasoning
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...
Input
24 765/M so'm
$1.95
Output
148 590/M so'm
$11.70
data_array 1,049K context
Google: Gemma 3 12B
google/gemma-3-12b-it
star Featured Reasoning
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Input
826/M so'm
$0.07
Output
2 477/M so'm
$0.20
data_array 131K context
Google: Gemma 3 27B
google/gemma-3-27b-it
star Featured Reasoning
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Input
1 321/M so'm
$0.10
Output
2 642/M so'm
$0.21
data_array 131K context
Google: Gemma 3 4B
google/gemma-3-4b-it
star Featured Reasoning
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Input
826/M so'm
$0.07
Output
1 651/M so'm
$0.13
data_array 131K context
Google: Gemma 4 31B
google/gemma-4-31b-it
star Featured Reasoning
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Input
1 981/M so'm
$0.16
Output
5 779/M so'm
$0.46
data_array 262K context
Google: Gemma 4 31B (free)
google/gemma-4-31b-it:free
free_breakfast Free star Featured Reasoning
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Input
Free
Output
Free
data_array 262K context
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
google/gemini-3-pro-image-preview
star Featured Reasoning
Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...
Input
33 020/M so'm
$2.60
Output
198 120/M so'm
$15.60
data_array 66K context
Google: Nano Banana Pro (Gemini 3 Pro Image)
google/gemini-3-pro-image
star Featured Reasoning
Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...
Input
33 020/M so'm
$2.60
Output
198 120/M so'm
$15.60
data_array 66K context