Model Center

Production-ready AI model catalog

The public BatchIn catalog prioritizes real live models, exclusives, and near-term launches. No filler, no fake cards.

38 models

Z.ai

Z.ai

glm-5.1

GLM-5.1

Live

SWE-Bench Pro #1

Open-source coding flagship built for long-horizon autonomous engineering and deep reasoning.

Release
Apr 07, 2026
Release
Apr 07, 2026
Max Output
128K
Context
198K
Pricing
$0.500 / $1.500

Cached Input: $0.175

MITOpen Sourcereasoningcodingfeatured

TTFT

520ms

Throughput

42 tok/s

View model detail
Z.ai

Z.ai

glm-5

GLM-5

Live

Lower-cost GLM route for production reasoning, agents, and long-context workflows.

Release
2026
Release
2026
Max Output
64K
Context
198K
Pricing
$0.350 / $0.900

Cached Input: $0.122

Apache 2.0Open Sourcereasoningworkflow

TTFT

520ms

Throughput

42 tok/s

View model detail
Z.ai

Z.ai

glm-4.7

GLM-4.7

Live

SWE-bench 73.8%

Mid-tier GLM reasoning route for engineering teams that need quality below flagship spend.

Release
2026
Release
2026
Max Output
64K
Context
198K
Pricing
$0.150 / $0.800

Cached Input: $0.052

Apache 2.0Open Sourcereasoningcoding

TTFT

420ms

Throughput

58 tok/s

View model detail
DeepSeek

DeepSeek

deepseek-r1

DeepSeek R1

Live

o1-class reasoning

Heavy reasoning model for difficult planning, math, research, and multi-step analysis.

Release
2025
Release
2025
Max Output
64K
Context
160K
Pricing
$0.180 / $0.600

Cached Input: $0.063

MITOpen Sourcereasoningmathresearch

TTFT

160ms

Throughput

120 tok/s

View model detail
DeepSeek

DeepSeek

deepseek-v3.2

DeepSeek V3.2

Live

IMO + IOI gold

Flagship DeepSeek release tuned for strong general reasoning at a very aggressive price point.

Release
2026
Release
2026
Max Output
64K
Context
160K
Pricing
$0.100 / $0.150

Cached Input: $0.035

MITOpen Sourcereasoningfeatured

TTFT

160ms

Throughput

120 tok/s

View model detail
DeepSeek

DeepSeek

deepseek-v3.1-terminus

DeepSeek V3.1 Terminus

Live

Higher-output DeepSeek route for workflows that need longer structured completions.

Release
2026
Release
2026
Max Output
64K
Context
160K
Pricing
$0.100 / $0.350

Cached Input: $0.035

MITOpen Sourcereasoningworkflow

TTFT

160ms

Throughput

120 tok/s

View model detail
DeepSeek

DeepSeek

deepseek-v3

DeepSeek V3

Live

Stable general-purpose DeepSeek route for large-scale chat and batch workloads.

Release
2025
Release
2025
Max Output
64K
Context
160K
Pricing
$0.080 / $0.280

Cached Input: $0.028

MITOpen Sourcechatbatch

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Alibaba

qwen3-32b

Qwen3-32B

Live

Balanced mid-large Qwen route for general chat, coding, and production assistant workloads.

Release
2026
Release
2026
Max Output
32K
Context
256K
Pricing
$0.020 / $0.080

Cached Input: $0.007

Apache 2.0Open Sourceqwengeneral

TTFT

220ms

Throughput

94 tok/s

View model detail
Qwen

Alibaba

qwen3.5-397b

Qwen3.5-397B-A17B

Live

201 languages

Top-tier Qwen MoE model for multilingual reasoning, coding, and large-context assistants.

Release
2026
Release
2026
Max Output
64K
Context
256K
Pricing
$0.100 / $0.050

Cached Input: $0.035

Apache 2.0Open Sourcemultilingualreasoning

TTFT

420ms

Throughput

58 tok/s

View model detail
Qwen

Alibaba

qwen3.5-122b

Qwen3.5-122B-A10B

Live

Balanced Qwen MoE for long-context assistants and cost-conscious production routing.

Release
2026
Release
2026
Max Output
32K
Context
256K
Pricing
$0.080 / $0.550

Cached Input: $0.028

Apache 2.0Open Sourceqwenlong-context

TTFT

310ms

Throughput

72 tok/s

View model detail
Qwen

Alibaba

qwen3.5-35b

Qwen3.5-35B-A3B

Live

Lower-cost MoE Qwen route for product copilots and high-volume assistant traffic.

Release
2026
Release
2026
Max Output
32K
Context
256K
Pricing
$0.060 / $0.450

Cached Input: $0.021

Apache 2.0Open Sourceqwenmoe

TTFT

220ms

Throughput

94 tok/s

View model detail
Qwen

Alibaba

qwen3.5-27b

Qwen3.5-27B

Live

exceeds GPT-5-mini

Lean Qwen route aimed at lower-cost chat, agent routing, and product copilot features.

Release
2026
Release
2026
Max Output
32K
Context
256K
Pricing
$0.070 / $0.500

Cached Input: $0.025

Apache 2.0Open Sourceqwenmid-tier

TTFT

220ms

Throughput

94 tok/s

View model detail