AI Модели

Используйте 300+ моделей от OpenAI, Anthropic, Google и других через единый API

memory 346 моделей
free_breakfast 30 бесплатных
business 58 провайдеров
AI21: Jamba Large 1.7
ai21/jamba-large-1.7
Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 256K context...
Ввод
33 020/M so'm
$2.60
Вывод
132 080/M so'm
$10.40
data_array 256K context
AionLabs: Aion-2.0
aion-labs/aion-2.0
Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It is particularly strong at introducing tension, crises, and conflict into stories, making narratives feel more engaging....
Ввод
13 208/M so'm
$1.04
Вывод
26 416/M so'm
$2.08
data_array 131K context
AionLabs: Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8b
Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto benchmark, a roleplaying-specific variant of Arena-Hard-Auto, where LLMs evaluate each other’s responses. It is a fine-tuned base model...
Ввод
13 208/M so'm
$1.04
Вывод
26 416/M so'm
$2.08
data_array 33K context
Amazon: Nova Micro 1.0
amazon/nova-micro-v1
Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...
Ввод
578/M so'm
$0.05
Вывод
2 311/M so'm
$0.18
data_array 128K context
Cohere: Command A
cohere/command-a
Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...
Ввод
41 275/M so'm
$3.25
Вывод
165 100/M so'm
$13.00
data_array 256K context
Cohere: Command R+ (08-2024)
cohere/command-r-plus-08-2024
command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...
Ввод
41 275/M so'm
$3.25
Вывод
165 100/M so'm
$13.00
data_array 128K context
Deep Cogito: Cogito v2.1 671B
deepcogito/cogito-v2.1-671b
Cogito v2.1 671B MoE represents one of the strongest open models globally, matching performance of frontier closed and open models. This model is trained using self play with reinforcement learning...
Ввод
20 638/M so'm
$1.63
Вывод
20 638/M so'm
$1.63
data_array 128K context
IBM: Granite 4.0 Micro
ibm-granite/granite-4.0-h-micro
Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...
Ввод
281/M so'm
$0.02
Вывод
1 849/M so'm
$0.15
data_array 131K context
IBM: Granite 4.1 8B
ibm-granite/granite-4.1-8b
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...
Ввод
826/M so'm
$0.07
Вывод
1 651/M so'm
$0.13
data_array 131K context
inclusionAI: Ling-2.6-1T
inclusionai/ling-2.6-1t
Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...
Ввод
1 238/M so'm
$0.10
Вывод
10 319/M so'm
$0.81
data_array 262K context
inclusionAI: Ling-2.6-flash
inclusionai/ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Ввод
165/M so'm
$0.01
Вывод
495/M so'm
$0.04
data_array 262K context
inclusionAI: Ring-2.6-1T
inclusionai/ring-2.6-1t
Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...
Ввод
1 238/M so'm
$0.10
Вывод
10 319/M so'm
$0.81
data_array 262K context
Inflection: Inflection 3 Pi
inflection/inflection-3-pi
Inflection 3 Pi powers Inflection's [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivity, and safety. It has access to recent news, and excels in scenarios like customer support and roleplay. Pi...
Ввод
41 275/M so'm
$3.25
Вывод
165 100/M so'm
$13.00
data_array 8K context
Inflection: Inflection 3 Productivity
inflection/inflection-3-productivity
Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. It has access to recent news. For emotional...
Ввод
41 275/M so'm
$3.25
Вывод
165 100/M so'm
$13.00
data_array 8K context
LiquidAI: LFM2-24B-A2B
liquid/lfm-2-24b-a2b
LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per...
Ввод
495/M so'm
$0.04
Вывод
1 981/M so'm
$0.16
data_array 128K context
LiquidAI: LFM2.5-1.2B-Instruct (free)
liquid/lfm-2.5-1.2b-instruct:free
free_breakfast Free
LFM2.5-1.2B-Instruct is a compact, high-performance instruction-tuned model built for fast on-device AI. It delivers strong chat quality in a 1.2B parameter footprint, with efficient edge inference and broad runtime support.
Ввод
Free
Вывод
Free
data_array 33K context
Magnum v4 72B
anthracite-org/magnum-v4-72b
This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).
Ввод
49 530/M so'm
$3.90
Вывод
82 550/M so'm
$6.50
data_array 33K context
Mancer: Weaver (alpha)
mancer/weaver
An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.
Ввод
12 383/M so'm
$0.98
Вывод
16 510/M so'm
$1.30
data_array 8K context
MiniMax: MiniMax M2-her
minimax/minimax-m2-her
MiniMax M2-her is a dialogue-first large language model built for immersive roleplay, character-driven chat, and expressive multi-turn conversations. Designed to stay consistent in tone and personality, it supports rich message...
Ввод
4 953/M so'm
$0.39
Вывод
19 812/M so'm
$1.56
data_array 66K context
MiniMax: MiniMax M2.1
minimax/minimax-m2.1
MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...
Ввод
4 788/M so'm
$0.38
Вывод
15 685/M so'm
$1.24
data_array 205K context
MiniMax: MiniMax M2.5
minimax/minimax-m2.5
MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...
Ввод
2 477/M so'm
$0.20
Вывод
14 859/M so'm
$1.17
data_array 205K context
MiniMax: MiniMax M2.7
minimax/minimax-m2.7
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...
Ввод
4 128/M so'm
$0.33
Вывод
16 510/M so'm
$1.30
data_array 205K context
Mistral: Devstral 2 2512
mistralai/devstral-2512
Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring...
Ввод
6 604/M so'm
$0.52
Вывод
33 020/M so'm
$2.60
data_array 262K context
Mistral: Mistral Nemo
mistralai/mistral-nemo
A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...
Ввод
330/M so'm
$0.03
Вывод
495/M so'm
$0.04
data_array 131K context