AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
star Featured Reasoning
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...
Input
4 953/M so'm
$0.39
Output
41 275/M so'm
$3.25
data_array 1,049K context
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
star Featured Reasoning
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Input
1 651/M so'm
$0.13
Output
6 604/M so'm
$0.52
data_array 1,049K context
Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025
star Featured Reasoning
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Input
1 651/M so'm
$0.13
Output
6 604/M so'm
$0.52
data_array 1,049K context
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
star Featured Reasoning
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Input
20 638/M so'm
$1.63
Output
165 100/M so'm
$13.00
data_array 1,049K context
Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06
star Featured Reasoning
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Input
20 638/M so'm
$1.63
Output
165 100/M so'm
$13.00
data_array 1,049K context
Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-preview
star Featured Reasoning
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Input
20 638/M so'm
$1.63
Output
165 100/M so'm
$13.00
data_array 1,049K context
Google: Gemini 3 Flash Preview
google/gemini-3-flash-preview
star Featured Reasoning
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...
Input
8 255/M so'm
$0.65
Output
49 530/M so'm
$3.90
data_array 1,049K context
Google: Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
star Featured Vision
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Input
4 128/M so'm
$0.33
Output
24 765/M so'm
$1.95
data_array 1,049K context
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview
star Featured Vision
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...
Input
4 128/M so'm
$0.33
Output
24 765/M so'm
$1.95
data_array 1,049K context
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
star Featured Reasoning
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...
Input
33 020/M so'm
$2.60
Output
198 120/M so'm
$15.60
data_array 1,049K context
Google: Gemini 3.1 Pro Preview Custom Tools
google/gemini-3.1-pro-preview-customtools
star Featured Vision
Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...
Input
33 020/M so'm
$2.60
Output
198 120/M so'm
$15.60
data_array 1,049K context
Google: Gemini 3.5 Flash
google/gemini-3.5-flash
star Featured Reasoning
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...
Input
24 765/M so'm
$1.95
Output
148 590/M so'm
$11.70
data_array 1,049K context
Google: Gemma 2 27B
google/gemma-2-27b-it
star Featured
Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...
Input
10 732/M so'm
$0.85
Output
10 732/M so'm
$0.85
data_array 8K context
Google: Gemma 3 12B
google/gemma-3-12b-it
star Featured Reasoning
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Input
826/M so'm
$0.07
Output
2 477/M so'm
$0.20
data_array 131K context
Google: Gemma 3 27B
google/gemma-3-27b-it
star Featured Reasoning
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Input
1 321/M so'm
$0.10
Output
2 642/M so'm
$0.21
data_array 131K context
Google: Gemma 3 4B
google/gemma-3-4b-it
star Featured Reasoning
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Input
826/M so'm
$0.07
Output
1 651/M so'm
$0.13
data_array 131K context
Google: Gemma 3n 4B
google/gemma-3n-e4b-it
star Featured
Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...
Input
991/M so'm
$0.08
Output
1 981/M so'm
$0.16
data_array 33K context
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
star Featured Vision
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Input
991/M so'm
$0.08
Output
5 448/M so'm
$0.43
data_array 262K context
Google: Gemma 4 26B A4B (free)
google/gemma-4-26b-a4b-it:free
free_breakfast Free star Featured Vision
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Input
Free
Output
Free
data_array 262K context
Google: Gemma 4 31B
google/gemma-4-31b-it
star Featured Reasoning
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Input
1 981/M so'm
$0.16
Output
5 779/M so'm
$0.46
data_array 262K context
Google: Gemma 4 31B (free)
google/gemma-4-31b-it:free
free_breakfast Free star Featured Reasoning
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Input
Free
Output
Free
data_array 262K context
Google: Lyria 3 Clip Preview
google/lyria-3-clip-preview
free_breakfast Free star Featured Vision
30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...
Input
Free
Output
Free
data_array 1,049K context
Google: Lyria 3 Pro Preview
google/lyria-3-pro-preview
free_breakfast Free star Featured Vision
Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...
Input
Free
Output
Free
data_array 1,049K context
Google: Nano Banana (Gemini 2.5 Flash Image)
google/gemini-2.5-flash-image
star Featured Vision
Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...
Input
4 953/M so'm
$0.39
Output
41 275/M so'm
$3.25
data_array 33K context