AI Models

Use 300+ models from OpenAI, Anthropic, Google and others through a single API

memory 346 models
free_breakfast 30 free
business 58 providers
Z.ai: GLM 4.5
z-ai/glm-4.5
GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...
Input
9 906/M so'm
$0.78
Output
36 322/M so'm
$2.86
data_array 131K context
Z.ai: GLM 4.5 Air
z-ai/glm-4.5-air
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...
Input
2 146/M so'm
$0.17
Output
14 034/M so'm
$1.11
data_array 131K context
Z.ai: GLM 4.5V
z-ai/glm-4.5v
Vision
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...
Input
9 906/M so'm
$0.78
Output
29 718/M so'm
$2.34
data_array 66K context
Z.ai: GLM 4.6
z-ai/glm-4.6
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...
Input
7 099/M so'm
$0.56
Output
28 727/M so'm
$2.26
data_array 203K context
Z.ai: GLM 4.6V
z-ai/glm-4.6v
Reasoning
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
Input
4 953/M so'm
$0.39
Output
14 859/M so'm
$1.17
data_array 131K context
Z.ai: GLM 4.7
z-ai/glm-4.7
Reasoning
GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...
Input
6 604/M so'm
$0.52
Output
28 893/M so'm
$2.28
data_array 203K context
Z.ai: GLM 4.7 Flash
z-ai/glm-4.7-flash
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...
Input
991/M so'm
$0.08
Output
6 604/M so'm
$0.52
data_array 203K context
Z.ai: GLM 5
z-ai/glm-5
GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...
Input
9 906/M so'm
$0.78
Output
31 699/M so'm
$2.50
data_array 203K context
Z.ai: GLM 5 Turbo
z-ai/glm-5-turbo
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...
Input
19 812/M so'm
$1.56
Output
66 040/M so'm
$5.20
data_array 262K context
Z.ai: GLM 5.1
z-ai/glm-5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...
Input
16 180/M so'm
$1.27
Output
50 851/M so'm
$4.00
data_array 203K context
Z.ai: GLM 5.2
z-ai/glm-5.2
Reasoning
GLM 5.2 is a large-scale reasoning model from Z.ai. It supports text input and output with a 1M-token context window, and is suited for long-horizon agent workflows, project-level software engineering,...
Input
16 180/M so'm
$1.27
Output
50 851/M so'm
$4.00
data_array 1,049K context
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
Vision
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...
Input
19 812/M so'm
$1.56
Output
66 040/M so'm
$5.20
data_array 203K context