AI Модели

Используйте 300+ моделей от OpenAI, Anthropic, Google и других через единый API

memory 346 моделей
free_breakfast 30 бесплатных
business 58 провайдеров
MythoMax 13B
gryphe/mythomax-l2-13b
One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge
Ввод
991/M so'm
$0.08
Вывод
991/M so'm
$0.08
data_array 4K context
Nex AGI: Nex-N2-Pro
nex-agi/nex-n2-pro
Vision
Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters out of 397B total. Built on the Qwen3.5 architecture, it accepts text and image input and produces...
Ввод
8 255/M so'm
$0.65
Вывод
41 275/M so'm
$3.25
data_array 262K context
Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405b
Reasoning
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...
Ввод
16 510/M so'm
$1.30
Вывод
16 510/M so'm
$1.30
data_array 131K context
Nous: Hermes 3 405B Instruct (free)
nousresearch/hermes-3-llama-3.1-405b:free
free_breakfast Free Reasoning
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...
Ввод
Free
Вывод
Free
data_array 131K context
Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70b
Reasoning
Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...
Ввод
11 557/M so'm
$0.91
Вывод
11 557/M so'm
$0.91
data_array 131K context
Nous: Hermes 4 405B
nousresearch/hermes-4-405b
Reasoning
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...
Ввод
16 510/M so'm
$1.30
Вывод
49 530/M so'm
$3.90
data_array 131K context
Nous: Hermes 4 70B
nousresearch/hermes-4-70b
Reasoning
Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...
Ввод
2 146/M so'm
$0.17
Вывод
6 604/M so'm
$0.52
data_array 131K context
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
Reasoning
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
Ввод
6 604/M so'm
$0.52
Вывод
6 604/M so'm
$0.52
data_array 131K context
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Ввод
826/M so'm
$0.07
Вывод
3 302/M so'm
$0.26
data_array 262K context
NVIDIA: Nemotron 3 Nano 30B A3B (free)
nvidia/nemotron-3-nano-30b-a3b:free
free_breakfast Free
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Ввод
Free
Вывод
Free
data_array 256K context
NVIDIA: Nemotron 3 Nano Omni (free)
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
free_breakfast Free Reasoning
NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...
Ввод
Free
Вывод
Free
data_array 256K context
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
Ввод
1 486/M so'm
$0.12
Вывод
7 430/M so'm
$0.59
data_array 1,000K context
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-120b-a12b:free
free_breakfast Free
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
Ввод
Free
Вывод
Free
data_array 1,000K context
NVIDIA: Nemotron 3 Ultra
nvidia/nemotron-3-ultra-550b-a55b
Reasoning
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Ввод
8 255/M so'm
$0.65
Вывод
36 322/M so'm
$2.86
data_array 1,000K context
NVIDIA: Nemotron 3 Ultra (free)
nvidia/nemotron-3-ultra-550b-a55b:free
free_breakfast Free Reasoning
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Ввод
Free
Вывод
Free
data_array 1,000K context
NVIDIA: Nemotron 3.5 Content Safety (free)
nvidia/nemotron-3.5-content-safety:free
free_breakfast Free Vision
NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...
Ввод
Free
Вывод
Free
data_array 128K context
NVIDIA: Nemotron Nano 12B 2 VL (free)
nvidia/nemotron-nano-12b-v2-vl:free
free_breakfast Free Reasoning
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
Ввод
Free
Вывод
Free
data_array 128K context
NVIDIA: Nemotron Nano 9B V2 (free)
nvidia/nemotron-nano-9b-v2:free
free_breakfast Free Reasoning
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
Ввод
Free
Вывод
Free
data_array 128K context
OpenAI GPT Latest
~openai/gpt-latest
Vision
This model always redirects to the latest model in the OpenAI GPT family.
Ввод
82 550/M so'm
$6.50
Вывод
495 300/M so'm
$39.00
data_array 1,050K context
OpenAI GPT Mini Latest
~openai/gpt-mini-latest
Vision
This model always redirects to the latest model in the OpenAI GPT Mini family.
Ввод
12 383/M so'm
$0.98
Вывод
74 295/M so'm
$5.85
data_array 400K context
Owl Alpha
openrouter/owl-alpha
free_breakfast Free Code
Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....
Ввод
Free
Вывод
Free
data_array 1,049K context
Perceptron: Perceptron Mk1
perceptron/perceptron-mk1
Reasoning
Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces detailed visual understanding...
Ввод
2 477/M so'm
$0.20
Вывод
24 765/M so'm
$1.95
data_array 33K context
Perplexity: Sonar
perplexity/sonar
Vision
Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...
Ввод
16 510/M so'm
$1.30
Вывод
16 510/M so'm
$1.30
data_array 127K context
Perplexity: Sonar Deep Research
perplexity/sonar-deep-research
Reasoning
Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers...
Ввод
33 020/M so'm
$2.60
Вывод
132 080/M so'm
$10.40
data_array 128K context