AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2
Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for...
Input
9 411/M so'm
$0.74
Output
37 973/M so'm
$2.99
data_array 131K context
MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905
Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32...
Input
9 906/M so'm
$0.78
Output
41 275/M so'm
$3.25
data_array 262K context
MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinking
Reasoning
Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...
Input
9 906/M so'm
$0.78
Output
41 275/M so'm
$3.25
data_array 262K context
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5
Vision
Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed...
Input
6 191/M so'm
$0.49
Output
33 433/M so'm
$2.63
data_array 262K context
MoonshotAI: Kimi K2.6
moonshotai/kimi-k2.6
Vision
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...
Input
10 897/M so'm
$0.86
Output
56 299/M so'm
$4.43
data_array 262K context
MoonshotAI: Kimi K2.7 Code
moonshotai/kimi-k2.7-code
Vision
MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts...
Input
10 104/M so'm
$0.80
Output
50 669/M so'm
$3.99
data_array 262K context