Pricing Calculator

Estimate your monthly costs with BatchIn and compare against major competitors in real time.

Vendor Overview

Start by vendor family when you already know which ecosystem or license profile fits your product.

Qwen

8

models in this group

Starts at $ 0.000 / 1M input

BatchIn Exclusive

6

models in this group

Starts at $ 0.010 / 1M input

DeepSeek

4

models in this group

Starts at $ 0.080 / 1M input

Z.ai

3

models in this group

Starts at $ 0.150 / 1M input

Baidu

2

models in this group

Starts at $ 0.000 / 1M input

Coming Soon

2

models in this group

Available on request

Best Starting Lanes

Fastest way to choose a route before digging into the full table.

Qwen3.5-4B

Alibaba

llm

Text starts at $ 0.000 / 1M input

FLUX.1-dev

Black Forest Labs

image

Image starts at $ 0.003 / image

Wan2.2-T2V

Wan

video

Video starts at $ 0.15 / clip

Fish-Speech-1.5

Fish Audio

audio

Audio starts at $ 4.00 / 1M bytes

Model Pricing

All prices in USD per 1M tokens · 38 models available

ModelProviderStatusInput / 1MCached / 1MOutput / 1MSiliconFlowTogether AIFireworks AIvs GPT-5.4
Z.ai
GLM-5.1

glm-5.1

SWE-Bench Pro #1
Z.aiAlways On$0.50$0.175$1.50$0.28 / $0.99$0.33 / $1.10$0.39 / $1.3283% cheaper
Z.ai
GLM-5

glm-5

Lower-cost GLM route for production reasoning, agents, and long-context workflows.
Z.aiAlways On$0.35$0.122$0.90$0.25 / $0.90$0.30 / $1.00$0.35 / $1.2088% cheaper
Z.ai
GLM-4.7

glm-4.7

SWE-bench 73.8%
Z.aiAlways On$0.15$0.052$0.80N/AN/AN/A95% cheaper
DeepSeek
DeepSeek R1

deepseek-r1

o1-class reasoning
DeepSeekAlways On$0.18$0.063$0.60$0.25 / $1.00$0.30 / $1.20$0.35 / $1.4094% cheaper
DeepSeek
DeepSeek V3.2

deepseek-v3.2

IMO + IOI gold
DeepSeekAlways On$0.10$0.035$0.15N/AN/AN/A97% cheaper
DeepSeek
DeepSeek V3.1 Terminus

deepseek-v3.1-terminus

Higher-output DeepSeek route for workflows that need longer structured completions.
DeepSeekAlways On$0.10$0.035$0.35N/AN/AN/A97% cheaper
DeepSeek
DeepSeek V3

deepseek-v3

Stable general-purpose DeepSeek route for large-scale chat and batch workloads.
DeepSeekAlways On$0.08$0.028$0.28N/AN/AN/A97% cheaper
A
Qwen3-32B

qwen3-32b

Balanced mid-large Qwen route for general chat, coding, and production assistant workloads.
AlibabaAlways On$0.02$0.007$0.08$0.04 / $0.12$0.05 / $0.15$0.05 / $0.1599% cheaper
A
Qwen3.5-397B-A17B

qwen3.5-397b

201 languages
AlibabaAlways On$0.10$0.035$0.05$0.08 / $0.32$0.10 / $0.40$0.12 / $0.4897% cheaper
A
Qwen3.5-122B-A10B

qwen3.5-122b

Balanced Qwen MoE for long-context assistants and cost-conscious production routing.
AlibabaAlways On$0.08$0.028$0.55N/AN/AN/A97% cheaper
A
Qwen3.5-35B-A3B

qwen3.5-35b

Lower-cost MoE Qwen route for product copilots and high-volume assistant traffic.
AlibabaAlways On$0.06$0.021$0.45N/AN/AN/A98% cheaper
A
Qwen3.5-27B

qwen3.5-27b

exceeds GPT-5-mini
AlibabaAlways On$0.07$0.025$0.50N/AN/AN/A98% cheaper
A
Qwen3.5-4B

qwen3.5-4b

FREE tier
AlibabaAlways On$0.00Free tier model.$0.00N/AN/AN/A100% cheaper
A
Qwen3-VL-32B

qwen3-vl-32b

Vision-capable Qwen model for document understanding, multimodal chat, and image-grounded workflows.
AlibabaAlways On$0.00Public BatchIn launch pricing pending.$0.00N/AN/AN/A100% cheaper
MiniMax
MiniMax M2.5

minimax-m2.5

SWE-Bench 80.2%
MiniMaxAlways On$0.10$0.035$0.40$0.04 / $0.12$0.05 / $0.15$0.06 / $0.1897% cheaper
MS
Kimi K2.5

kimi-k2.5

multimodal agent
MoonshotAlways On$0.10$0.035$1.00$0.12 / $0.80$0.15 / $1.00$0.20 / $1.2097% cheaper
OO
GPT-OSS-120B

gpt-oss-120b

OpenAI open-weight MoE with pragmatic pricing for general chat, agents, and product workflows.
OpenAI OSSAlways On$0.02$0.007$0.15N/AN/AN/A99% cheaper
OO
GPT-OSS-20B

gpt-oss-20b

Compact OpenAI open-weight option for fast chat, routing, and lower-cost product features.
OpenAI OSSAlways On$0.01$0.004$0.06N/AN/AN/A100% cheaper
BD
ERNIE 4.5-300B

ernie-4.5-300b

Baidu flagship route for broad Chinese and bilingual enterprise workloads.
BaiduAlways On$0.10$0.035$0.38N/AN/AN/A97% cheaper
TC
Hunyuan-A13B

hunyuan-a13b

Compact Tencent route for low-cost Chinese chat and product assistant scenarios.
TencentAlways On$0.05$0.018$0.20N/AN/AN/A98% cheaper
BD
PaddleOCR-VL-1.5

paddleocr-vl-1.5

FREE
BaiduAlways On$0.00Free model.$0.00N/AN/AN/A100% cheaper
S
Step-3.5-Flash

step-3.5-flash

BatchIn Exclusive
StepFunAlways On$0.01$0.004$0.04N/AN/AN/A100% cheaper
X
MiMo-V2-Flash

mimo-v2-flash

BatchIn Exclusive
XiaomiAlways On$0.01$0.004$0.05N/AN/AN/A100% cheaper
M
Devstral 2

devstral-2

73%+ SWE-bench
MistralAlways On$0.06$0.021$0.25N/AN/AN/A98% cheaper
NVIDIA
Nemotron 3 Super

nemotron-3-super

1M context
NVIDIAAlways On$0.04$0.014$0.15N/AN/AN/A99% cheaper
Meta
Llama 4 Maverick

llama-4-maverick

1M context
MetaAlways On$0.06$0.021$0.20N/AN/AN/A98% cheaper
A
Qwen3.5-9B

qwen3.5-9b

Compact long-context Qwen option for cost-sensitive API traffic and routing layers.
AlibabaAlways On$0.05$0.018$0.40N/AN/AN/A98% cheaper
B
BGE-M3

bge-m3

Multilingual embedding model for search, retrieval, and RAG ranking pipelines.
BAAIAlways On$0.01$0.003$0.00N/AN/AN/A100% cheaper
G
Gemma 4 27B

gemma-4-27b

NEW
GoogleAlways On$0.00Private preview pricing on request.$0.00N/AN/AN/A100% cheaper
DeepSeek
DeepSeek V4

deepseek-v4

Coming Soon
DeepSeekWarm$0.00Not public yet.$0.00N/AN/AN/A100% cheaper
DeepSeek
DeepSeek V4 Lite

deepseek-v4-lite

Coming Soon
DeepSeekWarm$0.00Not public yet.$0.00N/AN/AN/A100% cheaper

Qwen

8 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
Qwen
Qwen3-32B
qwen3-32b
256K$0.02$0.007$0.08$0.01$0.04N/AN/A
Qwen
Qwen3-VL-32B
qwen3-vl-32b
262KN/AN/AN/AN/AN/AN/AN/A
Qwen
Qwen3.5-122B-A10B
qwen3.5-122b
256K$0.08$0.028$0.55$0.04$0.28N/AN/A
Qwen
Qwen3.5-27B
qwen3.5-27b
exceeds GPT-5-mini
256K$0.07$0.025$0.50$0.04$0.25N/AN/A
Qwen
Qwen3.5-35B-A3B
qwen3.5-35b
256K$0.06$0.021$0.45$0.03$0.23N/AN/A
Qwen
Qwen3.5-397B-A17B
qwen3.5-397b
201 languages
256K$0.10$0.035$0.05$0.05$0.03N/AN/A
Qwen
Qwen3.5-4B
qwen3.5-4b
FREE tierFree
256K$0.00N/A$0.00$0.00$0.00N/AN/A
Qwen
Qwen3.5-9B
qwen3.5-9b
256K$0.05$0.018$0.40$0.03$0.20N/AN/A

BatchIn Exclusive

6 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
BatchIn
Devstral 2
devstral-2
73%+ SWE-bench
128K$0.06$0.021$0.25$0.03$0.13N/AN/A
BatchIn
Gemma 4 27B
gemma-4-27b
NEW
128KN/AN/AN/AN/AN/AN/AN/A
BatchIn
Llama 4 Maverick
llama-4-maverick
1M context
1M$0.06$0.021$0.20$0.03$0.10N/AN/A
BatchIn
MiMo-V2-Flash
mimo-v2-flash
BatchIn Exclusive
128K$0.01$0.004$0.05$0.01$0.03N/AN/A
BatchIn
Nemotron 3 Super
nemotron-3-super
1M context
1M$0.04$0.014$0.15$0.02$0.07N/AN/A
BatchIn
Step-3.5-Flash
step-3.5-flash
BatchIn Exclusive
128K$0.01$0.004$0.04$0.01$0.02N/AN/A

DeepSeek

4 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
DeepSeek
DeepSeek R1
deepseek-r1
o1-class reasoning
160K$0.18$0.063$0.60$0.09$0.30$0.50 / $2.1864%
DeepSeek
DeepSeek V3
deepseek-v3
160K$0.08$0.028$0.28$0.04$0.14$0.27 / $1.0070%
DeepSeek
DeepSeek V3.1 Terminus
deepseek-v3.1-terminus
160K$0.10$0.035$0.35$0.05$0.17$0.27 / $1.0063%
DeepSeek
DeepSeek V3.2
deepseek-v3.2
IMO + IOI gold
160K$0.10$0.035$0.15$0.05$0.07$0.27 / $0.4263%

Z.ai

3 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
Z.ai
GLM-4.7
glm-4.7
SWE-bench 73.8%
198K$0.15$0.052$0.80$0.07$0.40$0.42 / $2.2064%
Z.ai
GLM-5
glm-5
198K$0.35$0.122$0.90$0.17$0.45$0.95 / $2.5563%
Z.ai
GLM-5.1
glm-5.1
SWE-Bench Pro #1
198K$0.50$0.175$1.50$0.25$0.75$1.40 / $4.4064%

Baidu

2 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
BD
ERNIE 4.5-300B
ernie-4.5-300b
131K$0.10$0.035$0.38$0.05$0.19N/AN/A
BD
PaddleOCR-VL-1.5
paddleocr-vl-1.5
FREEFree
Document OCR$0.00N/A$0.00$0.00$0.00N/AN/A

Coming Soon

2 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
CS
DeepSeek V4
deepseek-v4
Coming Soon
1MN/AN/AN/AN/AN/AN/AN/A
CS
DeepSeek V4 Lite
deepseek-v4-lite
Coming Soon
TBDN/AN/AN/AN/AN/AN/AN/A

OpenAI

2 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
OA
GPT-OSS-120B
gpt-oss-120b
131K$0.02$0.007$0.15$0.01$0.07$0.05 / $0.4560%
OA
GPT-OSS-20B
gpt-oss-20b
131K$0.01$0.004$0.06$0.01$0.03$0.04 / $0.1875%

Embedding

1 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
Embedding
BGE-M3
bge-m3
8KN/A$0.003N/A$0.00N/AN/AN/A

MiniMax

1 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
MiniMax
MiniMax M2.5
minimax-m2.5
SWE-Bench 80.2%
192K$0.10$0.035$0.40$0.05$0.20$0.30 / $1.2067%

Moonshot

1 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
MS
Kimi K2.5
kimi-k2.5
multimodal agent
256K$0.10$0.035$1.00$0.05$0.50$0.23 / $3.0057%

Tencent

1 models in this group

Expand pricing
ModelContextInput / 1MCached / 1MOutput / 1MBatch InputBatch OutputSiliconFlowSavings
TC
Hunyuan-A13B
hunyuan-a13b
131K$0.05$0.018$0.20$0.03$0.10N/AN/A

Image Generation

Public image models with direct BatchIn vs SiliconFlow price visibility.

Black Forest Labs

FLUX.1-dev

Open FLUX route for fast image generation, previews, and creative asset pipelines.

BatchIn$0.003
SiliconFlow$0.006

Video Generation

Short-form video routes priced for product teams, not just demo clips.

Wan

Wan2.2-T2V

Production-friendly text-to-video model for short cinematic clips and promo content.

BatchIn$0.15
SiliconFlow$0.29

Wan

Wan2.2-I2V

Image-to-video route for product shots, creative transforms, and hero animation loops.

BatchIn$0.15
SiliconFlow$0.29

Audio Models

Speech and TTS lanes sized for narration, assistants, and voice UX.

CosyVoice

CosyVoice2-0.5B

Low-latency speech synthesis for narration, assistants, support voice, and content dubbing.

BatchIn$5.00
SiliconFlow$15.00

Fish Audio

Fish-Speech-1.5

Open speech route for expressive synthesis, cloned voice styles, and content narration.

BatchIn$4.00
SiliconFlow$12.00

IndexTTS

IndexTTS-2

Fast speech stack for product voice output, IVR systems, and developer voice UX.

BatchIn$5.00
SiliconFlow$15.00

Alibaba

Qwen3-TTS

Alibaba’s lightweight TTS route for voice UX and product narration tasks.

BatchIn$5.00
SiliconFlow$15.00

Configure

1K500K
1K500K
10100K

BatchIn (GLM-5.1)

$3,750.00/month

$125.00/day · 30,000 requests/month

OpenAI GPT-5.4 (comparison)

$27,000.00/month

You Save

$23,250.0086% off

per month compared to OpenAI GPT-5.4

SiliconFlow

$2,310.00

Estimated monthly cost for current slider settings

BatchIn saves ~-58%

Together AI

$2,640.00

Estimated monthly cost for current slider settings

BatchIn saves ~-40%

Fireworks AI

$3,135.00

Estimated monthly cost for current slider settings

BatchIn saves ~-17%

Platform Capability Matrix

Module-by-module comparison based on configured competitor capability metadata.

CapabilityBatchInSiliconFlowTogether AIFireworks AI
Batch jobsYesYesYesYes
Verifiable Audit (VaaS)YesNoNoNo
USDC top-upYesNoNoNo
Payment methodsStripe / Alipay / USDCAlipay, WeChat Pay, Credit CardStripe (Credit Card)Stripe (Credit Card)
Get Started

No minimum commitment. Pay only for what you use.