Pricing Calculator

Estimate your monthly costs with BatchIn and compare against major competitors in real time.

Vendor Overview

Start by vendor family when you already know which ecosystem or license profile fits your product.

Qwen

models in this group

Starts at $ 0.000 / 1M input

BatchIn Exclusive

models in this group

Starts at $ 0.010 / 1M input

DeepSeek

models in this group

Starts at $ 0.080 / 1M input

Z.ai

models in this group

Starts at $ 0.150 / 1M input

Baidu

models in this group

Starts at $ 0.000 / 1M input

Coming Soon

models in this group

Available on request

Best Starting Lanes

Fastest way to choose a route before digging into the full table.

Qwen3.5-4B

Alibaba

llm

Text starts at $ 0.000 / 1M input

FLUX.1-dev

Black Forest Labs

image

Image starts at $ 0.003 / image

Wan2.2-T2V

Wan

video

Video starts at $ 0.15 / clip

Fish-Speech-1.5

Fish Audio

audio

Audio starts at $ 4.00 / 1M bytes

Model Pricing

All prices in USD per 1M tokens · 38 models available

Model	Provider	Status	Input / 1M	Cached / 1M	Output / 1M	SiliconFlow	Together AI	Fireworks AI	vs GPT-5.4
GLM-5.1 glm-5.1 SWE-Bench Pro #1	Z.ai	Always On	$0.50	$0.175	$1.50	$0.28 / $0.99	$0.33 / $1.10	$0.39 / $1.32	83% cheaper
GLM-5 glm-5 Lower-cost GLM route for production reasoning, agents, and long-context workflows.	Z.ai	Always On	$0.35	$0.122	$0.90	$0.25 / $0.90	$0.30 / $1.00	$0.35 / $1.20	88% cheaper
GLM-4.7 glm-4.7 SWE-bench 73.8%	Z.ai	Always On	$0.15	$0.052	$0.80	N/A	N/A	N/A	95% cheaper
DeepSeek R1 deepseek-r1 o1-class reasoning	DeepSeek	Always On	$0.18	$0.063	$0.60	$0.25 / $1.00	$0.30 / $1.20	$0.35 / $1.40	94% cheaper
DeepSeek V3.2 deepseek-v3.2 IMO + IOI gold	DeepSeek	Always On	$0.10	$0.035	$0.15	N/A	N/A	N/A	97% cheaper
DeepSeek V3.1 Terminus deepseek-v3.1-terminus Higher-output DeepSeek route for workflows that need longer structured completions.	DeepSeek	Always On	$0.10	$0.035	$0.35	N/A	N/A	N/A	97% cheaper
DeepSeek V3 deepseek-v3 Stable general-purpose DeepSeek route for large-scale chat and batch workloads.	DeepSeek	Always On	$0.08	$0.028	$0.28	N/A	N/A	N/A	97% cheaper
A Qwen3-32B qwen3-32b Balanced mid-large Qwen route for general chat, coding, and production assistant workloads.	Alibaba	Always On	$0.02	$0.007	$0.08	$0.04 / $0.12	$0.05 / $0.15	$0.05 / $0.15	99% cheaper
A Qwen3.5-397B-A17B qwen3.5-397b 201 languages	Alibaba	Always On	$0.10	$0.035	$0.05	$0.08 / $0.32	$0.10 / $0.40	$0.12 / $0.48	97% cheaper
A Qwen3.5-122B-A10B qwen3.5-122b Balanced Qwen MoE for long-context assistants and cost-conscious production routing.	Alibaba	Always On	$0.08	$0.028	$0.55	N/A	N/A	N/A	97% cheaper
A Qwen3.5-35B-A3B qwen3.5-35b Lower-cost MoE Qwen route for product copilots and high-volume assistant traffic.	Alibaba	Always On	$0.06	$0.021	$0.45	N/A	N/A	N/A	98% cheaper
A Qwen3.5-27B qwen3.5-27b exceeds GPT-5-mini	Alibaba	Always On	$0.07	$0.025	$0.50	N/A	N/A	N/A	98% cheaper
A Qwen3.5-4B qwen3.5-4b FREE tier	Alibaba	Always On	$0.00	Free tier model.	$0.00	N/A	N/A	N/A	100% cheaper
A Qwen3-VL-32B qwen3-vl-32b Vision-capable Qwen model for document understanding, multimodal chat, and image-grounded workflows.	Alibaba	Always On	$0.00	Public BatchIn launch pricing pending.	$0.00	N/A	N/A	N/A	100% cheaper
MiniMax M2.5 minimax-m2.5 SWE-Bench 80.2%	MiniMax	Always On	$0.10	$0.035	$0.40	$0.04 / $0.12	$0.05 / $0.15	$0.06 / $0.18	97% cheaper
MS Kimi K2.5 kimi-k2.5 multimodal agent	Moonshot	Always On	$0.10	$0.035	$1.00	$0.12 / $0.80	$0.15 / $1.00	$0.20 / $1.20	97% cheaper
OO GPT-OSS-120B gpt-oss-120b OpenAI open-weight MoE with pragmatic pricing for general chat, agents, and product workflows.	OpenAI OSS	Always On	$0.02	$0.007	$0.15	N/A	N/A	N/A	99% cheaper
OO GPT-OSS-20B gpt-oss-20b Compact OpenAI open-weight option for fast chat, routing, and lower-cost product features.	OpenAI OSS	Always On	$0.01	$0.004	$0.06	N/A	N/A	N/A	100% cheaper
BD ERNIE 4.5-300B ernie-4.5-300b Baidu flagship route for broad Chinese and bilingual enterprise workloads.	Baidu	Always On	$0.10	$0.035	$0.38	N/A	N/A	N/A	97% cheaper
TC Hunyuan-A13B hunyuan-a13b Compact Tencent route for low-cost Chinese chat and product assistant scenarios.	Tencent	Always On	$0.05	$0.018	$0.20	N/A	N/A	N/A	98% cheaper
BD PaddleOCR-VL-1.5 paddleocr-vl-1.5 FREE	Baidu	Always On	$0.00	Free model.	$0.00	N/A	N/A	N/A	100% cheaper
S Step-3.5-Flash step-3.5-flash BatchIn Exclusive	StepFun	Always On	$0.01	$0.004	$0.04	N/A	N/A	N/A	100% cheaper
X MiMo-V2-Flash mimo-v2-flash BatchIn Exclusive	Xiaomi	Always On	$0.01	$0.004	$0.05	N/A	N/A	N/A	100% cheaper
M Devstral 2 devstral-2 73%+ SWE-bench	Mistral	Always On	$0.06	$0.021	$0.25	N/A	N/A	N/A	98% cheaper
Nemotron 3 Super nemotron-3-super 1M context	NVIDIA	Always On	$0.04	$0.014	$0.15	N/A	N/A	N/A	99% cheaper
Llama 4 Maverick llama-4-maverick 1M context	Meta	Always On	$0.06	$0.021	$0.20	N/A	N/A	N/A	98% cheaper
A Qwen3.5-9B qwen3.5-9b Compact long-context Qwen option for cost-sensitive API traffic and routing layers.	Alibaba	Always On	$0.05	$0.018	$0.40	N/A	N/A	N/A	98% cheaper
B BGE-M3 bge-m3 Multilingual embedding model for search, retrieval, and RAG ranking pipelines.	BAAI	Always On	$0.01	$0.003	$0.00	N/A	N/A	N/A	100% cheaper
G Gemma 4 27B gemma-4-27b NEW	Google	Always On	$0.00	Private preview pricing on request.	$0.00	N/A	N/A	N/A	100% cheaper
DeepSeek V4 deepseek-v4 Coming Soon	DeepSeek	Warm	$0.00	Not public yet.	$0.00	N/A	N/A	N/A	100% cheaper
DeepSeek V4 Lite deepseek-v4-lite Coming Soon	DeepSeek	Warm	$0.00	Not public yet.	$0.00	N/A	N/A	N/A	100% cheaper

Qwen

8 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
Qwen3-32B qwen3-32b	256K	$0.02	$0.007	$0.08	$0.01	$0.04	N/A	N/A
Qwen3-VL-32B qwen3-vl-32b	262K	N/A	N/A	N/A	N/A	N/A	N/A	N/A
Qwen3.5-122B-A10B qwen3.5-122b	256K	$0.08	$0.028	$0.55	$0.04	$0.28	N/A	N/A
Qwen3.5-27B qwen3.5-27b exceeds GPT-5-mini	256K	$0.07	$0.025	$0.50	$0.04	$0.25	N/A	N/A
Qwen3.5-35B-A3B qwen3.5-35b	256K	$0.06	$0.021	$0.45	$0.03	$0.23	N/A	N/A
Qwen3.5-397B-A17B qwen3.5-397b 201 languages	256K	$0.10	$0.035	$0.05	$0.05	$0.03	N/A	N/A
Qwen3.5-4B qwen3.5-4b FREE tierFree	256K	$0.00	N/A	$0.00	$0.00	$0.00	N/A	N/A
Qwen3.5-9B qwen3.5-9b	256K	$0.05	$0.018	$0.40	$0.03	$0.20	N/A	N/A

BatchIn Exclusive

6 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
Devstral 2 devstral-2 73%+ SWE-bench	128K	$0.06	$0.021	$0.25	$0.03	$0.13	N/A	N/A
Gemma 4 27B gemma-4-27b NEW	128K	N/A	N/A	N/A	N/A	N/A	N/A	N/A
Llama 4 Maverick llama-4-maverick 1M context	1M	$0.06	$0.021	$0.20	$0.03	$0.10	N/A	N/A
MiMo-V2-Flash mimo-v2-flash BatchIn Exclusive	128K	$0.01	$0.004	$0.05	$0.01	$0.03	N/A	N/A
Nemotron 3 Super nemotron-3-super 1M context	1M	$0.04	$0.014	$0.15	$0.02	$0.07	N/A	N/A
Step-3.5-Flash step-3.5-flash BatchIn Exclusive	128K	$0.01	$0.004	$0.04	$0.01	$0.02	N/A	N/A

DeepSeek

4 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
DeepSeek R1 deepseek-r1 o1-class reasoning	160K	$0.18	$0.063	$0.60	$0.09	$0.30	$0.50 / $2.18	64%
DeepSeek V3 deepseek-v3	160K	$0.08	$0.028	$0.28	$0.04	$0.14	$0.27 / $1.00	70%
DeepSeek V3.1 Terminus deepseek-v3.1-terminus	160K	$0.10	$0.035	$0.35	$0.05	$0.17	$0.27 / $1.00	63%
DeepSeek V3.2 deepseek-v3.2 IMO + IOI gold	160K	$0.10	$0.035	$0.15	$0.05	$0.07	$0.27 / $0.42	63%

Z.ai

3 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
GLM-4.7 glm-4.7 SWE-bench 73.8%	198K	$0.15	$0.052	$0.80	$0.07	$0.40	$0.42 / $2.20	64%
GLM-5 glm-5	198K	$0.35	$0.122	$0.90	$0.17	$0.45	$0.95 / $2.55	63%
GLM-5.1 glm-5.1 SWE-Bench Pro #1	198K	$0.50	$0.175	$1.50	$0.25	$0.75	$1.40 / $4.40	64%

Baidu

2 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
BD ERNIE 4.5-300B ernie-4.5-300b	131K	$0.10	$0.035	$0.38	$0.05	$0.19	N/A	N/A
BD PaddleOCR-VL-1.5 paddleocr-vl-1.5 FREEFree	Document OCR	$0.00	N/A	$0.00	$0.00	$0.00	N/A	N/A

Coming Soon

2 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
CS DeepSeek V4 deepseek-v4 Coming Soon	1M	N/A	N/A	N/A	N/A	N/A	N/A	N/A
CS DeepSeek V4 Lite deepseek-v4-lite Coming Soon	TBD	N/A	N/A	N/A	N/A	N/A	N/A	N/A

OpenAI

2 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
OA GPT-OSS-120B gpt-oss-120b	131K	$0.02	$0.007	$0.15	$0.01	$0.07	$0.05 / $0.45	60%
OA GPT-OSS-20B gpt-oss-20b	131K	$0.01	$0.004	$0.06	$0.01	$0.03	$0.04 / $0.18	75%

Embedding

1 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
BGE-M3 bge-m3	8K	N/A	$0.003	N/A	$0.00	N/A	N/A	N/A

MiniMax

1 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
MiniMax M2.5 minimax-m2.5 SWE-Bench 80.2%	192K	$0.10	$0.035	$0.40	$0.05	$0.20	$0.30 / $1.20	67%

Moonshot

1 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
MS Kimi K2.5 kimi-k2.5 multimodal agent	256K	$0.10	$0.035	$1.00	$0.05	$0.50	$0.23 / $3.00	57%

Tencent

1 models in this group

Expand pricing

Model	Context	Input / 1M	Cached / 1M	Output / 1M	Batch Input	Batch Output	SiliconFlow	Savings
TC Hunyuan-A13B hunyuan-a13b	131K	$0.05	$0.018	$0.20	$0.03	$0.10	N/A	N/A

Image Generation

Public image models with direct BatchIn vs SiliconFlow price visibility.

Black Forest Labs

FLUX.1-dev

Open FLUX route for fast image generation, previews, and creative asset pipelines.

BatchIn$0.003

SiliconFlow$0.006

Video Generation

Short-form video routes priced for product teams, not just demo clips.

Wan

Wan2.2-T2V

Production-friendly text-to-video model for short cinematic clips and promo content.

BatchIn$0.15

SiliconFlow$0.29

Wan

Wan2.2-I2V

Image-to-video route for product shots, creative transforms, and hero animation loops.

BatchIn$0.15

SiliconFlow$0.29

Audio Models

Speech and TTS lanes sized for narration, assistants, and voice UX.

CosyVoice

CosyVoice2-0.5B

Low-latency speech synthesis for narration, assistants, support voice, and content dubbing.

BatchIn$5.00

SiliconFlow$15.00

Fish Audio

Fish-Speech-1.5

Open speech route for expressive synthesis, cloned voice styles, and content narration.

BatchIn$4.00

SiliconFlow$12.00

IndexTTS

IndexTTS-2

Fast speech stack for product voice output, IVR systems, and developer voice UX.

BatchIn$5.00

SiliconFlow$15.00

Alibaba

Qwen3-TTS

Alibaba’s lightweight TTS route for voice UX and product narration tasks.

BatchIn$5.00

SiliconFlow$15.00

Configure

Model

Avg Input Tokens per Request: 100K

1K500K

Avg Output Tokens per Request: 50K

1K500K

Requests per Day: 1,000

10100K

BatchIn (GLM-5.1)

$3,750.00/month

$125.00/day · 30,000 requests/month

OpenAI GPT-5.4 (comparison)

$27,000.00/month

You Save

$23,250.0086% off

per month compared to OpenAI GPT-5.4

SiliconFlow

$2,310.00

Estimated monthly cost for current slider settings

BatchIn saves ~-58%

Together AI

$2,640.00

Estimated monthly cost for current slider settings

BatchIn saves ~-40%

Fireworks AI

$3,135.00

Estimated monthly cost for current slider settings

BatchIn saves ~-17%

Platform Capability Matrix

Module-by-module comparison based on configured competitor capability metadata.

Capability	BatchIn	SiliconFlow	Together AI	Fireworks AI
Batch jobs	Yes	Yes	Yes	Yes
Verifiable Audit (VaaS)	Yes	No	No	No
USDC top-up	Yes	No	No	No
Payment methods	Stripe / Alipay / USDC	Alipay, WeChat Pay, Credit Card	Stripe (Credit Card)	Stripe (Credit Card)

Get Started

No minimum commitment. Pay only for what you use.