AI Модели — CloudAPI

MythoMax 13B

gryphe/mythomax-l2-13b

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

Ввод

991/M so'm

$0.08

Вывод

991/M so'm

$0.08

data_array 4K context

Nex AGI: Nex-N2-Pro

nex-agi/nex-n2-pro

Vision

Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters out of 397B total. Built on the Qwen3.5 architecture, it accepts text and image input and produces...

Ввод

8 255/M so'm

$0.65

Вывод

41 275/M so'm

$3.25

data_array 262K context

Nous: Hermes 3 405B Instruct

nousresearch/hermes-3-llama-3.1-405b

Reasoning

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

Ввод

16 510/M so'm

$1.30

Вывод

16 510/M so'm

$1.30

data_array 131K context

Nous: Hermes 3 405B Instruct (free)

nousresearch/hermes-3-llama-3.1-405b:free

free_breakfast Free Reasoning

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

Ввод

Free

Вывод

Free

data_array 131K context

Nous: Hermes 3 70B Instruct

nousresearch/hermes-3-llama-3.1-70b

Reasoning

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

Ввод

11 557/M so'm

$0.91

Вывод

11 557/M so'm

$0.91

data_array 131K context

Nous: Hermes 4 405B

nousresearch/hermes-4-405b

Reasoning

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

Ввод

16 510/M so'm

$1.30

Вывод

49 530/M so'm

$3.90

data_array 131K context

Nous: Hermes 4 70B

nousresearch/hermes-4-70b

Reasoning

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

Ввод

2 146/M so'm

$0.17

Вывод

6 604/M so'm

$0.52

data_array 131K context

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia/llama-3.3-nemotron-super-49b-v1.5

Reasoning

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Ввод

6 604/M so'm

$0.52

Вывод

6 604/M so'm

$0.52

data_array 131K context

NVIDIA: Nemotron 3 Nano 30B A3B

nvidia/nemotron-3-nano-30b-a3b

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Ввод

826/M so'm

$0.07

Вывод

3 302/M so'm

$0.26

data_array 262K context

NVIDIA: Nemotron 3 Nano 30B A3B (free)

nvidia/nemotron-3-nano-30b-a3b:free

free_breakfast Free

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Ввод

Free

Вывод

Free

data_array 256K context

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

free_breakfast Free Reasoning

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

Ввод

Free

Вывод

Free

data_array 256K context

NVIDIA: Nemotron 3 Super

nvidia/nemotron-3-super-120b-a12b

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Ввод

1 486/M so'm

$0.12

Вывод

7 430/M so'm

$0.59

data_array 1,000K context

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

free_breakfast Free

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Ввод

Free

Вывод

Free

data_array 1,000K context

NVIDIA: Nemotron 3 Ultra

nvidia/nemotron-3-ultra-550b-a55b

Reasoning

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Ввод

8 255/M so'm

$0.65

Вывод

36 322/M so'm

$2.86

data_array 1,000K context

NVIDIA: Nemotron 3 Ultra (free)

nvidia/nemotron-3-ultra-550b-a55b:free

free_breakfast Free Reasoning

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Ввод

Free

Вывод

Free

data_array 1,000K context

NVIDIA: Nemotron 3.5 Content Safety (free)

nvidia/nemotron-3.5-content-safety:free

free_breakfast Free Vision

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...

Ввод

Free

Вывод

Free

data_array 128K context

NVIDIA: Nemotron Nano 12B 2 VL (free)

nvidia/nemotron-nano-12b-v2-vl:free

free_breakfast Free Reasoning

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

Ввод

Free

Вывод

Free

data_array 128K context

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia/nemotron-nano-9b-v2:free

free_breakfast Free Reasoning

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

Ввод

Free

Вывод

Free

data_array 128K context

OpenAI GPT Latest

~openai/gpt-latest

Vision

This model always redirects to the latest model in the OpenAI GPT family.

Ввод

82 550/M so'm

$6.50

Вывод

495 300/M so'm

$39.00

data_array 1,050K context

OpenAI GPT Mini Latest

~openai/gpt-mini-latest

Vision

This model always redirects to the latest model in the OpenAI GPT Mini family.

Ввод

12 383/M so'm

$0.98

Вывод

74 295/M so'm

$5.85

data_array 400K context

Owl Alpha

openrouter/owl-alpha

free_breakfast Free Code

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....

Ввод

Free

Вывод

Free

data_array 1,049K context

Perceptron: Perceptron Mk1

perceptron/perceptron-mk1

Reasoning

Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces detailed visual understanding...

Ввод

2 477/M so'm

$0.20

Вывод

24 765/M so'm

$1.95

data_array 33K context

Perplexity: Sonar

perplexity/sonar

Vision

Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...

Ввод

16 510/M so'm

$1.30

Вывод

16 510/M so'm

$1.30

data_array 127K context

Perplexity: Sonar Deep Research

perplexity/sonar-deep-research

Reasoning

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers...

Ввод

33 020/M so'm

$2.60

Вывод

132 080/M so'm

$10.40

data_array 128K context