All models. One API.
Text, coding, image, video, audio, embedding & rerank — one OpenAI-compatible API.
DeepSeek V4 Pro
DeepSeek's next-generation flagship. 1M context, native reasoning mode, strong agentic coding and STEM.
DeepSeek V3.2
Frontier-tier general intelligence with strong reasoning, coding, and multilingual coverage.
DeepSeek V4 Flash
DeepSeek V4's fast variant. Same 1M context and reasoning mode, optimised for high concurrency and low latency.
Claude Sonnet 4.6
Anthropic's workhorse coding and chat model. Best Arabic among Western flagships, strong tool use, 200K context.
Qwen3.5-397B-A17B
Alibaba's flagship MoE — 397B total / 17B active. Frontier general intelligence with the best Arabic in the catalog.
GPT-5.4
OpenAI's flagship. Frontier reasoning, vision input, 400K context. Tier 1 pricing up to 272K.
Claude Opus 4.7
Anthropic's frontier model. Highest capability for complex reasoning, long-form, and agentic tasks.
Grok 4
xAI's Grok 4. Real-time-aware reasoning model with strong coding and math.
Llama 3.3 70B Instruct
Meta's Llama 3.3 70B Instruct. Open-weight foundation, widely fine-tuned, reliable baseline.
DeepSeek R1 0528
Open chain-of-thought reasoning. Competitive on math, coding, and logic benchmarks against frontier reasoning models.
Kimi K2.5
Kimi K2.5 — Moonshot AI flagship. 200K context, strong agentic and tool-use performance.
DeepSeek V3.1 Terminus
V3.1 with the Terminus refresh — improved coding, longer effective context, lower hallucination rate.
Gemma 4 31B IT
Google text generation model. Available via THALAM.
GLM 4.6V
Zhipu's flagship vision-language model. Strong document, OCR, and chart understanding.
GLM-5
GLM-5 — the prior generation. Cheaper than 5.1, still very capable for general use.
GLM-5.1
Zhipu's flagship GLM 5.1 — long context, strong multilingual coverage, reasonable pricing.
GPT-5 mini
GPT-5 mini — smaller, faster sibling. Capable and well-suited to high-volume production tasks.
GPT-OSS 120B
OpenAI's open-weight 120B coding model. Apache-licensed, fully self-hostable.
Kimi K2 Thinking
MoonshotAI text generation model. Available via THALAM.
Llama 4 Maverick
Llama 4 Maverick — Meta MoE flagship. Open-weight, 128 experts, FP8 quantized for speed.
Llama 4 Scout
Llama 4 Scout — efficient sibling. 16 experts, lower cost, still very capable.
MiniMax M2.5
M2.5 — the slightly older MiniMax. Same pricing as M2.7, kept for backward-compat.
MiniMax M2.7
MiniMax M2.7 — fast multilingual chat. Strong Asian languages, decent English, cheap.
Mistral Nemo
Cheapest model in the catalog. Mistral-Nemo for budget pipelines.
Qwen3 Max
Qwen3 Max — long context, strong reasoning, tiered pricing for high-volume use.
Qwen3.5-122B-A10B
Mid-tier Qwen3.5 MoE. The price/performance sweet spot for production workloads.
Qwen3 Coder 30B A3B
IDE-tier coding model. Cheap, fast, fits inline-completion workloads.
Qwen3 Coder 480B A35B
Qwen3 Coder 480B — Alibaba's flagship coding MoE. 256K context, 100+ languages, fill-in-the-middle.
Qwen3 Coder Next
Alibaba coding model. Available via THALAM.
FLUX 2 Pro
FLUX 2 Pro — frontier text-to-image. State-of-the-art aesthetics, prompt fidelity, fine detail.
FLUX.1 Kontext Max
FLUX.1 Kontext Max — image edit and conditional generation with text prompts.
GLM Image
GLM Image — Zhipu’s general image model. Solid quality at low cost.
Hunyuan Image 3
Tencent's Hunyuan Image 3 — strong photorealism, multi-aspect ratio support.
Seedream 4.0
Seedream 4.0 — best image deal in the catalog.
Seedream 4.5
Seedream 4.5 — refresh of 4.0, same anchor pricing, slightly better fidelity.
Seedream 5.0 Lite
Seedream 5.0 Lite — newest generation, smaller checkpoint, fast.
Z Image Turbo
Z Image Turbo — cheapest image model, $0.005/image price anchor.
Hunyuan Video Fast
Hunyuan Video Fast — Tencent’s speed-optimised T2V at $0.30/video.
Kling V2.6 Pro Motion
Kuaishou video generation model.
Kling V3.0 Pro I2V
Kling V3.0 Pro — image-to-video. Animate stills with the same engine as T2V.
Kling V3.0 Pro T2V
Kling V3.0 Pro — Kuaishou's flagship text-to-video. State-of-the-art motion fidelity.
Kling-o1 Edit Video
Kuaishou video generation model.
MiniMax Hailuo 02
MiniMax Hailuo 02 — fast, affordable text-to-video.
PixVerse V4.5 T2V
PixVerse video generation model.
Seedance 1.5 Pro I2V
ByteDance video generation model.
Seedance 1.5 Pro T2V
ByteDance video generation model.
Vidu Q3 Pro T2V
Shengshu video generation model.
Wan 2.5 T2V Preview
Wan 2.5 T2V Preview — best video deal in the catalog.
Wan 2.6 T2V
Wan 2.6 T2V — Alibaba’s text-to-video. Strong on cinematic shots and long takes.
ElevenLabs v3
ElevenLabs v3 — gold-standard voice synthesis.
Fish Audio TTS
Fish Audio TTS — multilingual voice synthesis with cloning.
MiniMax 2.8 HD Async
MiniMax audio model (TTS / STT).
MiniMax 2.8 Turbo
MiniMax 2.8 Turbo — latency-optimized variant for real-time TTS use cases.
MiniMax Speech 2.8 HD
MiniMax Speech 2.8 HD — top-tier multilingual TTS with natural prosody.
Llama 3.1 8B Instruct
Meta's compact 8B Instruct — strong baseline for high-throughput backend chat and agent loops at near-zero cost.
Qwen MT Plus
Alibaba's dedicated machine-translation model. Tuned for accuracy on Arabic↔English and 90+ language pairs.
Qwen3 VL 235B A22B Instruct
Flagship vision-language model from Alibaba. 235B MoE with active-22B routing, 131K context, strong image understanding.
Qwen2.5 VL 72B Instruct
Mature 72B dense vision-language model. Baseline VLM tier — different price-quality point from the 235B flagship.
BAAI BGE-M3
Industry-standard multilingual embedding. Dense, sparse, and multi-vector retrieval in one model. The RAG default.
Qwen3 Embedding 0.6B
Tiny 0.6B Qwen3 embedding — designed for high-throughput RAG with a 32K window. Arabic-strong alternative to BGE.
BGE Reranker v2-M3
Industry-standard multilingual reranker. Drop in as the second stage of any RAG pipeline behind a vector recall step.
Qwen Image Edit
Prompt-driven image editor from Alibaba. Single-pass edits like 'add sunglasses', 'change background to beach', 'remove the person on the left'.
FLUX.1 Kontext Pro
Mid-tier FLUX.1 Kontext — image edit and text-guided generation. Slots between Schnell (fast/cheap) and Kontext Max (premium).
Kling V3.0 4K T2V
Kling 3.0 in 4K — text-to-video at premium resolution. Different SKU from the standard-resolution V3.0 Pro.
Kling V3.0 4K I2V
Kling 3.0 in 4K — image-to-video. Animate a still in premium resolution for hero / campaign output.
MiniMax Hailuo 2.3 T2V
Current-generation MiniMax Hailuo — text-to-video. Sharper motion and longer coherence than the Hailuo 02 predecessor.
MiniMax Hailuo 2.3 I2V
Image-to-video variant of Hailuo 2.3. MiniMax's first I2V model — animate stills with prompt-controlled motion.
Wan 2.7 T2V
Latest-gen Wan — text-to-video. Per-second pricing at $0.10/s makes it the cost anchor for long-form video.
Wan 2.7 I2V
Latest-gen Wan — image-to-video. Same $0.10/s pricing as T2V — the budget default for animating stills.
GLM ASR
Zhipu's speech-to-text — multilingual ASR for transcription, meeting capture, and voice-input flows.
MiniMax Voice Cloning
Clone any voice from a 30-second reference sample. The premium tier for voice replication — MiniMax leads the quality bar in this category.
MiniMax Voice Design
Design a custom voice from a text description — 'middle-aged Arabic male, warm tone, slight raspy edge'. Companion to Voice Cloning.
Fish Audio Voice Clone
Cheap voice cloning at $0.10/voice — 24× cheaper than MiniMax. Trade quality for cost; use for higher-volume cases.
MiniMax Music
Text-to-music generation. Describe the song you want — genre, mood, instrumentation, lyrics — and the model composes and renders a track.