Qwen
8
models in this group
Starts at $ 0.000 / 1M input
Estimate your monthly costs with BatchIn and compare against major competitors in real time.
Start by vendor family when you already know which ecosystem or license profile fits your product.
Qwen
8
models in this group
Starts at $ 0.000 / 1M input
BatchIn Exclusive
6
models in this group
Starts at $ 0.010 / 1M input
DeepSeek
4
models in this group
Starts at $ 0.080 / 1M input
Z.ai
3
models in this group
Starts at $ 0.150 / 1M input
Baidu
2
models in this group
Starts at $ 0.000 / 1M input
Coming Soon
2
models in this group
Available on request
Fastest way to choose a route before digging into the full table.
Qwen3.5-4B
Alibaba
Text starts at $ 0.000 / 1M input
FLUX.1-dev
Black Forest Labs
Image starts at $ 0.003 / image
Wan2.2-T2V
Wan
Video starts at $ 0.15 / clip
Fish-Speech-1.5
Fish Audio
Audio starts at $ 4.00 / 1M bytes
All prices in USD per 1M tokens · 38 models available
| Model | Provider | Status | Input / 1M | Cached / 1M | Output / 1M | SiliconFlow | Together AI | Fireworks AI | vs GPT-5.4 |
|---|---|---|---|---|---|---|---|---|---|
GLM-5.1 glm-5.1 | Z.ai | Always On | $0.50 | $0.175 | $1.50 | $0.28 / $0.99 | $0.33 / $1.10 | $0.39 / $1.32 | 83% cheaper |
GLM-5 glm-5 | Z.ai | Always On | $0.35 | $0.122 | $0.90 | $0.25 / $0.90 | $0.30 / $1.00 | $0.35 / $1.20 | 88% cheaper |
GLM-4.7 glm-4.7 | Z.ai | Always On | $0.15 | $0.052 | $0.80 | N/A | N/A | N/A | 95% cheaper |
DeepSeek R1 deepseek-r1 | DeepSeek | Always On | $0.18 | $0.063 | $0.60 | $0.25 / $1.00 | $0.30 / $1.20 | $0.35 / $1.40 | 94% cheaper |
DeepSeek V3.2 deepseek-v3.2 | DeepSeek | Always On | $0.10 | $0.035 | $0.15 | N/A | N/A | N/A | 97% cheaper |
DeepSeek V3.1 Terminus deepseek-v3.1-terminus | DeepSeek | Always On | $0.10 | $0.035 | $0.35 | N/A | N/A | N/A | 97% cheaper |
DeepSeek V3 deepseek-v3 | DeepSeek | Always On | $0.08 | $0.028 | $0.28 | N/A | N/A | N/A | 97% cheaper |
A Balanced mid-large Qwen route for general chat, coding, and production assistant workloads.Qwen3-32B qwen3-32b | Alibaba | Always On | $0.02 | $0.007 | $0.08 | $0.04 / $0.12 | $0.05 / $0.15 | $0.05 / $0.15 | 99% cheaper |
A 201 languagesQwen3.5-397B-A17B qwen3.5-397b | Alibaba | Always On | $0.10 | $0.035 | $0.05 | $0.08 / $0.32 | $0.10 / $0.40 | $0.12 / $0.48 | 97% cheaper |
A Balanced Qwen MoE for long-context assistants and cost-conscious production routing.Qwen3.5-122B-A10B qwen3.5-122b | Alibaba | Always On | $0.08 | $0.028 | $0.55 | N/A | N/A | N/A | 97% cheaper |
A Lower-cost MoE Qwen route for product copilots and high-volume assistant traffic.Qwen3.5-35B-A3B qwen3.5-35b | Alibaba | Always On | $0.06 | $0.021 | $0.45 | N/A | N/A | N/A | 98% cheaper |
A exceeds GPT-5-miniQwen3.5-27B qwen3.5-27b | Alibaba | Always On | $0.07 | $0.025 | $0.50 | N/A | N/A | N/A | 98% cheaper |
A FREE tierQwen3.5-4B qwen3.5-4b | Alibaba | Always On | $0.00 | Free tier model. | $0.00 | N/A | N/A | N/A | 100% cheaper |
A Vision-capable Qwen model for document understanding, multimodal chat, and image-grounded workflows.Qwen3-VL-32B qwen3-vl-32b | Alibaba | Always On | $0.00 | Public BatchIn launch pricing pending. | $0.00 | N/A | N/A | N/A | 100% cheaper |
MiniMax M2.5 minimax-m2.5 | MiniMax | Always On | $0.10 | $0.035 | $0.40 | $0.04 / $0.12 | $0.05 / $0.15 | $0.06 / $0.18 | 97% cheaper |
MS multimodal agentKimi K2.5 kimi-k2.5 | Moonshot | Always On | $0.10 | $0.035 | $1.00 | $0.12 / $0.80 | $0.15 / $1.00 | $0.20 / $1.20 | 97% cheaper |
OO OpenAI open-weight MoE with pragmatic pricing for general chat, agents, and product workflows.GPT-OSS-120B gpt-oss-120b | OpenAI OSS | Always On | $0.02 | $0.007 | $0.15 | N/A | N/A | N/A | 99% cheaper |
OO Compact OpenAI open-weight option for fast chat, routing, and lower-cost product features.GPT-OSS-20B gpt-oss-20b | OpenAI OSS | Always On | $0.01 | $0.004 | $0.06 | N/A | N/A | N/A | 100% cheaper |
BD Baidu flagship route for broad Chinese and bilingual enterprise workloads.ERNIE 4.5-300B ernie-4.5-300b | Baidu | Always On | $0.10 | $0.035 | $0.38 | N/A | N/A | N/A | 97% cheaper |
TC Compact Tencent route for low-cost Chinese chat and product assistant scenarios.Hunyuan-A13B hunyuan-a13b | Tencent | Always On | $0.05 | $0.018 | $0.20 | N/A | N/A | N/A | 98% cheaper |
BD FREEPaddleOCR-VL-1.5 paddleocr-vl-1.5 | Baidu | Always On | $0.00 | Free model. | $0.00 | N/A | N/A | N/A | 100% cheaper |
S BatchIn ExclusiveStep-3.5-Flash step-3.5-flash | StepFun | Always On | $0.01 | $0.004 | $0.04 | N/A | N/A | N/A | 100% cheaper |
X BatchIn ExclusiveMiMo-V2-Flash mimo-v2-flash | Xiaomi | Always On | $0.01 | $0.004 | $0.05 | N/A | N/A | N/A | 100% cheaper |
M 73%+ SWE-benchDevstral 2 devstral-2 | Mistral | Always On | $0.06 | $0.021 | $0.25 | N/A | N/A | N/A | 98% cheaper |
Nemotron 3 Super nemotron-3-super | NVIDIA | Always On | $0.04 | $0.014 | $0.15 | N/A | N/A | N/A | 99% cheaper |
Llama 4 Maverick llama-4-maverick | Meta | Always On | $0.06 | $0.021 | $0.20 | N/A | N/A | N/A | 98% cheaper |
A Compact long-context Qwen option for cost-sensitive API traffic and routing layers.Qwen3.5-9B qwen3.5-9b | Alibaba | Always On | $0.05 | $0.018 | $0.40 | N/A | N/A | N/A | 98% cheaper |
B Multilingual embedding model for search, retrieval, and RAG ranking pipelines.BGE-M3 bge-m3 | BAAI | Always On | $0.01 | $0.003 | $0.00 | N/A | N/A | N/A | 100% cheaper |
G NEWGemma 4 27B gemma-4-27b | Always On | $0.00 | Private preview pricing on request. | $0.00 | N/A | N/A | N/A | 100% cheaper | |
DeepSeek V4 deepseek-v4 | DeepSeek | Warm | $0.00 | Not public yet. | $0.00 | N/A | N/A | N/A | 100% cheaper |
DeepSeek V4 Lite deepseek-v4-lite | DeepSeek | Warm | $0.00 | Not public yet. | $0.00 | N/A | N/A | N/A | 100% cheaper |
8 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
Qwen3-32B qwen3-32b | 256K | $0.02 | $0.007 | $0.08 | $0.01 | $0.04 | N/A | N/A |
Qwen3-VL-32B qwen3-vl-32b | 262K | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
Qwen3.5-122B-A10B qwen3.5-122b | 256K | $0.08 | $0.028 | $0.55 | $0.04 | $0.28 | N/A | N/A |
Qwen3.5-27B qwen3.5-27b exceeds GPT-5-mini | 256K | $0.07 | $0.025 | $0.50 | $0.04 | $0.25 | N/A | N/A |
Qwen3.5-35B-A3B qwen3.5-35b | 256K | $0.06 | $0.021 | $0.45 | $0.03 | $0.23 | N/A | N/A |
Qwen3.5-397B-A17B qwen3.5-397b 201 languages | 256K | $0.10 | $0.035 | $0.05 | $0.05 | $0.03 | N/A | N/A |
Qwen3.5-4B qwen3.5-4b FREE tierFree | 256K | $0.00 | N/A | $0.00 | $0.00 | $0.00 | N/A | N/A |
Qwen3.5-9B qwen3.5-9b | 256K | $0.05 | $0.018 | $0.40 | $0.03 | $0.20 | N/A | N/A |
6 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
Devstral 2 devstral-2 73%+ SWE-bench | 128K | $0.06 | $0.021 | $0.25 | $0.03 | $0.13 | N/A | N/A |
Gemma 4 27B gemma-4-27b NEW | 128K | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
Llama 4 Maverick llama-4-maverick 1M context | 1M | $0.06 | $0.021 | $0.20 | $0.03 | $0.10 | N/A | N/A |
MiMo-V2-Flash mimo-v2-flash BatchIn Exclusive | 128K | $0.01 | $0.004 | $0.05 | $0.01 | $0.03 | N/A | N/A |
Nemotron 3 Super nemotron-3-super 1M context | 1M | $0.04 | $0.014 | $0.15 | $0.02 | $0.07 | N/A | N/A |
Step-3.5-Flash step-3.5-flash BatchIn Exclusive | 128K | $0.01 | $0.004 | $0.04 | $0.01 | $0.02 | N/A | N/A |
4 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
DeepSeek R1 deepseek-r1 o1-class reasoning | 160K | $0.18 | $0.063 | $0.60 | $0.09 | $0.30 | $0.50 / $2.18 | 64% |
DeepSeek V3 deepseek-v3 | 160K | $0.08 | $0.028 | $0.28 | $0.04 | $0.14 | $0.27 / $1.00 | 70% |
DeepSeek V3.1 Terminus deepseek-v3.1-terminus | 160K | $0.10 | $0.035 | $0.35 | $0.05 | $0.17 | $0.27 / $1.00 | 63% |
DeepSeek V3.2 deepseek-v3.2 IMO + IOI gold | 160K | $0.10 | $0.035 | $0.15 | $0.05 | $0.07 | $0.27 / $0.42 | 63% |
3 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
GLM-4.7 glm-4.7 SWE-bench 73.8% | 198K | $0.15 | $0.052 | $0.80 | $0.07 | $0.40 | $0.42 / $2.20 | 64% |
GLM-5 glm-5 | 198K | $0.35 | $0.122 | $0.90 | $0.17 | $0.45 | $0.95 / $2.55 | 63% |
GLM-5.1 glm-5.1 SWE-Bench Pro #1 | 198K | $0.50 | $0.175 | $1.50 | $0.25 | $0.75 | $1.40 / $4.40 | 64% |
2 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
BD ERNIE 4.5-300B ernie-4.5-300b | 131K | $0.10 | $0.035 | $0.38 | $0.05 | $0.19 | N/A | N/A |
BD PaddleOCR-VL-1.5 paddleocr-vl-1.5 FREEFree | Document OCR | $0.00 | N/A | $0.00 | $0.00 | $0.00 | N/A | N/A |
2 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
CS DeepSeek V4 deepseek-v4 Coming Soon | 1M | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
CS DeepSeek V4 Lite deepseek-v4-lite Coming Soon | TBD | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
2 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
OA GPT-OSS-120B gpt-oss-120b | 131K | $0.02 | $0.007 | $0.15 | $0.01 | $0.07 | $0.05 / $0.45 | 60% |
OA GPT-OSS-20B gpt-oss-20b | 131K | $0.01 | $0.004 | $0.06 | $0.01 | $0.03 | $0.04 / $0.18 | 75% |
1 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
BGE-M3 bge-m3 | 8K | N/A | $0.003 | N/A | $0.00 | N/A | N/A | N/A |
1 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
MiniMax M2.5 minimax-m2.5 SWE-Bench 80.2% | 192K | $0.10 | $0.035 | $0.40 | $0.05 | $0.20 | $0.30 / $1.20 | 67% |
1 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
MS Kimi K2.5 kimi-k2.5 multimodal agent | 256K | $0.10 | $0.035 | $1.00 | $0.05 | $0.50 | $0.23 / $3.00 | 57% |
1 models in this group
| Model | Context | Input / 1M | Cached / 1M | Output / 1M | Batch Input | Batch Output | SiliconFlow | Savings |
|---|---|---|---|---|---|---|---|---|
TC Hunyuan-A13B hunyuan-a13b | 131K | $0.05 | $0.018 | $0.20 | $0.03 | $0.10 | N/A | N/A |
Public image models with direct BatchIn vs SiliconFlow price visibility.
Black Forest Labs
Open FLUX route for fast image generation, previews, and creative asset pipelines.
Short-form video routes priced for product teams, not just demo clips.
Wan
Production-friendly text-to-video model for short cinematic clips and promo content.
Wan
Image-to-video route for product shots, creative transforms, and hero animation loops.
Speech and TTS lanes sized for narration, assistants, and voice UX.
CosyVoice
Low-latency speech synthesis for narration, assistants, support voice, and content dubbing.
Fish Audio
Open speech route for expressive synthesis, cloned voice styles, and content narration.
IndexTTS
Fast speech stack for product voice output, IVR systems, and developer voice UX.
Alibaba
Alibaba’s lightweight TTS route for voice UX and product narration tasks.
$125.00/day · 30,000 requests/month
per month compared to OpenAI GPT-5.4
SiliconFlow
$2,310.00
Estimated monthly cost for current slider settings
BatchIn saves ~-58%
Together AI
$2,640.00
Estimated monthly cost for current slider settings
BatchIn saves ~-40%
Fireworks AI
$3,135.00
Estimated monthly cost for current slider settings
BatchIn saves ~-17%
Module-by-module comparison based on configured competitor capability metadata.
| Capability | BatchIn | SiliconFlow | Together AI | Fireworks AI |
|---|---|---|---|---|
| Batch jobs | Yes | Yes | Yes | Yes |
| Verifiable Audit (VaaS) | Yes | No | No | No |
| USDC top-up | Yes | No | No | No |
| Payment methods | Stripe / Alipay / USDC | Alipay, WeChat Pay, Credit Card | Stripe (Credit Card) | Stripe (Credit Card) |
No minimum commitment. Pay only for what you use.