Model Center

Production-ready AI model catalog

The public BatchIn catalog prioritizes real live models, exclusives, and near-term launches. No filler, no fake cards.

38 models

Z.ai

glm-5.1

GLM-5.1

Live

SWE-Bench Pro #1

Open-source coding flagship built for long-horizon autonomous engineering and deep reasoning.

Release: Apr 07, 2026
Release: Apr 07, 2026
Max Output: 128K
Context: 198K
Pricing: $0.500 / $1.500

MITOpen Sourcereasoningcodingfeatured

TTFT

520ms

Throughput

42 tok/s

View model detail

Z.ai

glm-5

GLM-5

Live

Lower-cost GLM route for production reasoning, agents, and long-context workflows.

Release: 2026
Release: 2026
Max Output: 64K
Context: 198K
Pricing: $0.350 / $0.900

Apache 2.0Open Sourcereasoningworkflow

TTFT

520ms

Throughput

42 tok/s

View model detail

Z.ai

glm-4.7

GLM-4.7

Live

SWE-bench 73.8%

Mid-tier GLM reasoning route for engineering teams that need quality below flagship spend.

Release: 2026
Release: 2026
Max Output: 64K
Context: 198K
Pricing: $0.150 / $0.800

Apache 2.0Open Sourcereasoningcoding

TTFT

420ms

Throughput

58 tok/s

View model detail

DeepSeek

deepseek-r1

DeepSeek R1

Live

o1-class reasoning

Heavy reasoning model for difficult planning, math, research, and multi-step analysis.

Release: 2025
Release: 2025
Max Output: 64K
Context: 160K
Pricing: $0.180 / $0.600

MITOpen Sourcereasoningmathresearch

TTFT

160ms

Throughput

120 tok/s

View model detail

DeepSeek

deepseek-v3.2

DeepSeek V3.2

Live

IMO + IOI gold

Flagship DeepSeek release tuned for strong general reasoning at a very aggressive price point.

Release: 2026
Release: 2026
Max Output: 64K
Context: 160K
Pricing: $0.100 / $0.150

MITOpen Sourcereasoningfeatured

TTFT

160ms

Throughput

120 tok/s

View model detail

DeepSeek

deepseek-v3.1-terminus

DeepSeek V3.1 Terminus

Live

Higher-output DeepSeek route for workflows that need longer structured completions.

Release: 2026
Release: 2026
Max Output: 64K
Context: 160K
Pricing: $0.100 / $0.350

MITOpen Sourcereasoningworkflow

TTFT

160ms

Throughput

120 tok/s

View model detail

DeepSeek

deepseek-v3

DeepSeek V3

Live

Stable general-purpose DeepSeek route for large-scale chat and batch workloads.

Release: 2025
Release: 2025
Max Output: 64K
Context: 160K
Pricing: $0.080 / $0.280

MITOpen Sourcechatbatch

TTFT

160ms

Throughput

120 tok/s

View model detail

Alibaba

qwen3-32b

Qwen3-32B

Live

Balanced mid-large Qwen route for general chat, coding, and production assistant workloads.

Release: 2026
Release: 2026
Max Output: 32K
Context: 256K
Pricing: $0.020 / $0.080

Apache 2.0Open Sourceqwengeneral

TTFT

220ms

Throughput

94 tok/s

View model detail

Alibaba

qwen3.5-397b

Qwen3.5-397B-A17B

Live

201 languages

Top-tier Qwen MoE model for multilingual reasoning, coding, and large-context assistants.

Release: 2026
Release: 2026
Max Output: 64K
Context: 256K
Pricing: $0.100 / $0.050

Apache 2.0Open Sourcemultilingualreasoning

TTFT

420ms

Throughput

58 tok/s

View model detail

Alibaba

qwen3.5-122b

Qwen3.5-122B-A10B

Live

Balanced Qwen MoE for long-context assistants and cost-conscious production routing.

Release: 2026
Release: 2026
Max Output: 32K
Context: 256K
Pricing: $0.080 / $0.550

Apache 2.0Open Sourceqwenlong-context

TTFT

310ms

Throughput

72 tok/s

View model detail

Alibaba

qwen3.5-35b

Qwen3.5-35B-A3B

Live

Lower-cost MoE Qwen route for product copilots and high-volume assistant traffic.

Release: 2026
Release: 2026
Max Output: 32K
Context: 256K
Pricing: $0.060 / $0.450

Apache 2.0Open Sourceqwenmoe

TTFT

220ms

Throughput

94 tok/s

View model detail

Alibaba

qwen3.5-27b

Qwen3.5-27B

Live

exceeds GPT-5-mini

Lean Qwen route aimed at lower-cost chat, agent routing, and product copilot features.

Release: 2026
Release: 2026
Max Output: 32K
Context: 256K
Pricing: $0.070 / $0.500

Apache 2.0Open Sourceqwenmid-tier

TTFT

220ms

Throughput

94 tok/s

View model detail