AI Models — CloudAPI

Google: Gemini 2.5 Flash

google/gemini-2.5-flash

star Featured Reasoning

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Input

4 953/M so'm

$0.39

Output

41 275/M so'm

$3.25

data_array 1,049K context

Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

star Featured Reasoning

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Input

1 651/M so'm

$0.13

Output

6 604/M so'm

$0.52

data_array 1,049K context

Google: Gemini 2.5 Flash Lite Preview 09-2025

google/gemini-2.5-flash-lite-preview-09-2025

star Featured Reasoning

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Input

1 651/M so'm

$0.13

Output

6 604/M so'm

$0.52

data_array 1,049K context

Google: Gemini 2.5 Pro

google/gemini-2.5-pro

star Featured Reasoning

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Input

20 638/M so'm

$1.63

Output

165 100/M so'm

$13.00

data_array 1,049K context

Google: Gemini 2.5 Pro Preview 05-06

google/gemini-2.5-pro-preview-05-06

star Featured Reasoning

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Input

20 638/M so'm

$1.63

Output

165 100/M so'm

$13.00

data_array 1,049K context

Google: Gemini 2.5 Pro Preview 06-05

google/gemini-2.5-pro-preview

star Featured Reasoning

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Input

20 638/M so'm

$1.63

Output

165 100/M so'm

$13.00

data_array 1,049K context

Google: Gemini 3 Flash Preview

google/gemini-3-flash-preview

star Featured Reasoning

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

Input

8 255/M so'm

$0.65

Output

49 530/M so'm

$3.90

data_array 1,049K context

Google: Gemini 3.1 Pro Preview

google/gemini-3.1-pro-preview

star Featured Reasoning

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Input

33 020/M so'm

$2.60

Output

198 120/M so'm

$15.60

data_array 1,049K context

Google: Gemini 3.5 Flash

google/gemini-3.5-flash

star Featured Reasoning

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Input

24 765/M so'm

$1.95

Output

148 590/M so'm

$11.70

data_array 1,049K context

Google: Gemma 3 12B

google/gemma-3-12b-it

star Featured Reasoning

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Input

826/M so'm

$0.07

Output

2 477/M so'm

$0.20

data_array 131K context

Google: Gemma 3 27B

google/gemma-3-27b-it

star Featured Reasoning

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Input

1 321/M so'm

$0.10

Output

2 642/M so'm

$0.21

data_array 131K context

Google: Gemma 3 4B

google/gemma-3-4b-it

star Featured Reasoning

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Input

826/M so'm

$0.07

Output

1 651/M so'm

$0.13

data_array 131K context

Google: Gemma 4 31B

google/gemma-4-31b-it

star Featured Reasoning

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Input

1 981/M so'm

$0.16

Output

5 779/M so'm

$0.46

data_array 262K context

Google: Gemma 4 31B (free)

google/gemma-4-31b-it:free

free_breakfast Free star Featured Reasoning

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Input

Free

Output

Free

data_array 262K context

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

google/gemini-3-pro-image-preview

star Featured Reasoning

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

Input

33 020/M so'm

$2.60

Output

198 120/M so'm

$15.60

data_array 66K context

Google: Nano Banana Pro (Gemini 3 Pro Image)

google/gemini-3-pro-image

star Featured Reasoning

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

Input

33 020/M so'm

$2.60

Output

198 120/M so'm

$15.60

data_array 66K context