35 models · 7 families · one account

The full model stack, one key

From frontier GPT and Claude to Gemini, DeepSeek, GLM, Kimi, Qwen and more — every model below is live and reachable from a single OpenAI-compatible account and key.

View pricing View documentation

Context windowAdjustable reasoning effortImage generation

GPT

OpenAI-compatible GPT models for demanding coding, research, and image generation.

4 models

GPT-5.5

gpt-5.5

Frontier GPT model for demanding coding, research, and long problem-solving.

1.1M contextReasoning

GPT-5.4

gpt-5.4

Strong all-around GPT model with wide headroom for heavy coding sessions.

1.1M contextReasoning

GPT-5.4 Mini

gpt-5.4-mini

Smaller, faster GPT model for quick iterations and lighter tasks.

400K contextReasoning

GPT Image 2

gpt-image-2

Image generation for OpenAI-compatible image routes, editors, and automations.

Image

Claude

Anthropic Claude models for writing, analysis, and long structured work.

5 models

Claude Opus 4.8

claude-opus-4-8

Top-tier Claude model for demanding writing, analysis, and long structured tasks.

200K context

Claude Opus 4.7

claude-opus-4-7

High-capability Claude Opus model for complex reasoning and long-form work.

200K context

Claude Opus 4.6

claude-opus-4-6

Capable Claude Opus model for deep analysis and structured generation.

200K context

Claude Sonnet 4.6

claude-sonnet-4-6

Balanced Claude model for fast, high-quality everyday work.

200K context

Claude Haiku 4.5

claude-haiku-4-5

Fast, lightweight Claude model for quick responses at high volume.

200K context

Gemini

Google-family models tuned for fast multimodal and agentic workflows.

7 models

Gemini 2.5 Pro

gemini-2.5-pro

Flagship Gemini model for multimodal coding and reasoning.

1M context

Gemini 2.5 Flash

gemini-2.5-flash

Balanced Gemini model for fast multimodal and agentic work.

1M context

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite

Lowest-cost Gemini model for lightweight loops and tooling.

1M context

Gemini 3 Pro Preview

gemini-3-pro-preview

Preview Gemini model for heavier multimodal and coding sessions.

1M context

Gemini 3.1 Pro Preview

gemini-3.1-pro-preview

Newer Gemini Pro preview with stronger agentic coding.

1M context

Gemini 3 Flash Preview

gemini-3-flash-preview

Faster Gemini preview for responsive, high-volume tasks.

1M context

Gemini 3.5 Flash

gemini-3.5-flash

Fast Gemini 3.5 lane for low-latency multimodal requests.

1M context

DeepSeek

Large-context DeepSeek models for heavier reasoning at low cost.

4 models

DeepSeek V4 Flash

deepseek-v4-flash

Lowest-cost, large-context DeepSeek model for fast iterations.

1M context

DeepSeek V4 Pro

deepseek-v4-pro

Flagship large-context DeepSeek model for heavier reasoning.

1M context

DeepSeek V4 Pro Lightning

deepseek-v4-pro-lightning

High-speed DeepSeek V4 Pro variant for heavy agent loops.

1M context

DeepSeek V3.2

deepseek-v3.2

Balanced DeepSeek reasoning model with a wide context window.

164K context

GLM

GLM models with wide context windows for long agent sessions.

5 models

GLM 5.2

glm-5.2

Latest GLM flagship with a million-token context for long sessions.

1M contextReasoning

GLM 5.1

glm-5.1

Strong GLM reasoning model for complex work with broad context.

203K context

GLM 5

glm-5

Strong GLM model for general coding and agent workflows.

203K context

GLM 4.7

glm-4.7

Balanced GLM model with solid context at lower cost.

203K context

GLM 4.7 Flash

glm-4.7-flash

Fast, low-cost GLM model for high-volume agent tasks.

203K context

Kimi

Kimi models for large-context multimodal coding and research.

4 models

Kimi K2.7 Code

kimi-k2.7-code

Kimi coding specialist with a very large context window.

262K context

Kimi K2.6

kimi-k2.6

Large-context Kimi model for heavier research and coding.

262K context

Kimi K2.5

kimi-k2.5

Balanced Kimi model for general-purpose agent work.

262K context

Kimi K2.5 Lightning

kimi-k2.5-lightning

Premium Kimi lightning variant for fast responses.

131K context

Qwen & More

Qwen and additional specialist models for broad, lower-cost work.

6 models

Mimo V2.5 Pro

mimo-v2.5-pro

Large-context reasoning model for heavier coding and planning.

1M context

Gemma 4 31B IT

gemma-4-31b-it

Fast, lightweight model for everyday coding and assistant work.

262K context

MiniMax M2.5

minimax-m2.5

Low-cost reasoning model for long prompts and agent turns.

205K context

Qwen 3.6 27B

qwen3.6-27b

Fast multimodal Qwen model for broad coding and research.

262K context

Qwen 3.5 397B A17B

qwen3.5-397b-a17b

High-capacity Qwen model for larger multimodal workloads.

262K context

Qwen 3.5 9B

qwen3.5-9b

Small, fast Qwen model for low-cost iteration.

262K context