
VaaS — Verifiable Inference™
Verifiable InferenceProof at every token
AI InferenceNo Filters. No Limits
OpenAI-compatible inference for teams that want direct model access, signed audit trails, batch-first economics, and global USDC billing
Models
38
Always-On
36
Cheaper
30-50%
VaaS
Every Call

Get Started in 3 Steps
OpenAI-compatible API, no code changes needed, up and running in 60 seconds
1
Sign Up & Get API Key
Create an account, copy your API key, and apply an invite code if you have one
batchin-sk-xxxx...2
Change base_url
Using OpenAI SDK? Just change one line of code
client = OpenAI( base_url="https://api.luminapath.tech/v1", api_key="YOUR_KEY" )
3
Start Inferencing
38 models across text, code, image, video, audio, and embeddings with streaming, batch inference, and VaaS audit
glm-5.1deepseek-v3.2qwen3-32bdeepseek-r1kimi-k2.5











Developer Trust
Switch to BatchIn in one line
OpenAI-compatible by default. Validate in Playground first, then move repeatable traffic into Batch.
from openai import OpenAI
client = OpenAI(
base_url="https://api.luminapath.tech/v1",
api_key="YOUR_BATCHIN_KEY"
)
response = client.chat.completions.create(
model="glm-5.1",
messages=[{"role": "user", "content": "Summarize this meeting"}]
)Model CatalogChat, Image, Video, TTS, Embeddings, and MORE
Affordable, Reliable, and Easy to Adopt for Teams of Any Size.
Batch Pricing AdvantageBatch 50% OFF
Z.ai
Model ID: glm-5.1
GLM-5.1
- Total Context
- 198K
- Max Output
- 128K
- Std Input Price
- $0.50 /M
- Std Output Price
- $1.50 /M
- Batch Input Price
- $0.25 /M
- Batch Output Price
- $0.75 /M
Z.ai
Model ID: glm-5
GLM-5
- Total Context
- 198K
- Max Output
- 64K
- Std Input Price
- $0.35 /M
- Std Output Price
- $0.90 /M
- Batch Input Price
- $0.17 /M
- Batch Output Price
- $0.45 /M
BAAI
Model ID: bge-m3
BGE-M3
- Total Context
- 8K
- Max Output
- Vector
- Std Input Price
- $0.01 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.00 /M
- Batch Output Price
- $0.00 /M
CosyVoice
Model ID: cosyvoice2-0.5b
CosyVoice2-0.5B
- Total Context
- Voice synthesis
- Max Output
- Audio
- Std Input Price
- $5.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $5.00 /M
- Batch Output Price
- $0.00 /M
DeepSeek
Model ID: deepseek-r1
DeepSeek R1
- Total Context
- 160K
- Max Output
- 64K
- Std Input Price
- $0.18 /M
- Std Output Price
- $0.60 /M
- Batch Input Price
- $0.09 /M
- Batch Output Price
- $0.30 /M
DeepSeek
Model ID: deepseek-v3
DeepSeek V3
- Total Context
- 160K
- Max Output
- 64K
- Std Input Price
- $0.08 /M
- Std Output Price
- $0.28 /M
- Batch Input Price
- $0.04 /M
- Batch Output Price
- $0.14 /M
DeepSeek
Model ID: deepseek-v3.1-terminus
DeepSeek V3.1 Terminus
- Total Context
- 160K
- Max Output
- 64K
- Std Input Price
- $0.10 /M
- Std Output Price
- $0.35 /M
- Batch Input Price
- $0.05 /M
- Batch Output Price
- $0.17 /M
DeepSeek
Model ID: deepseek-v3.2
DeepSeek V3.2
- Total Context
- 160K
- Max Output
- 64K
- Std Input Price
- $0.10 /M
- Std Output Price
- $0.15 /M
- Batch Input Price
- $0.05 /M
- Batch Output Price
- $0.07 /M
Mistral
Model ID: devstral-2
Devstral 2
- Total Context
- 128K
- Max Output
- 32K
- Std Input Price
- $0.06 /M
- Std Output Price
- $0.25 /M
- Batch Input Price
- $0.03 /M
- Batch Output Price
- $0.13 /M
Baidu
Model ID: ernie-4.5-300b
ERNIE 4.5-300B
- Total Context
- 131K
- Max Output
- 32K
- Std Input Price
- $0.10 /M
- Std Output Price
- $0.38 /M
- Batch Input Price
- $0.05 /M
- Batch Output Price
- $0.19 /M
Fish Audio
Model ID: fish-speech-1-5
Fish-Speech-1.5
- Total Context
- Voice synthesis
- Max Output
- Audio
- Std Input Price
- $4.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $4.00 /M
- Batch Output Price
- $0.00 /M
Black Forest Labs
Model ID: flux-schnell
FLUX.1-dev
- Total Context
- Prompt-based
- Max Output
- Image
- Std Input Price
- $0.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.00 /M
- Batch Output Price
- $0.00 /M
Model ID: gemma-4-27b
Gemma 4 27B
- Total Context
- 128K
- Max Output
- 32K
- Std Input Price
- $0.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.00 /M
- Batch Output Price
- $0.00 /M
Z.ai
Model ID: glm-4.7
GLM-4.7
- Total Context
- 198K
- Max Output
- 64K
- Std Input Price
- $0.15 /M
- Std Output Price
- $0.80 /M
- Batch Input Price
- $0.07 /M
- Batch Output Price
- $0.40 /M
OpenAI OSS
Model ID: gpt-oss-120b
GPT-OSS-120B
- Total Context
- 131K
- Max Output
- 32K
- Std Input Price
- $0.02 /M
- Std Output Price
- $0.15 /M
- Batch Input Price
- $0.01 /M
- Batch Output Price
- $0.07 /M
OpenAI OSS
Model ID: gpt-oss-20b
GPT-OSS-20B
- Total Context
- 131K
- Max Output
- 16K
- Std Input Price
- $0.01 /M
- Std Output Price
- $0.06 /M
- Batch Input Price
- $0.01 /M
- Batch Output Price
- $0.03 /M
Tencent
Model ID: hunyuan-a13b
Hunyuan-A13B
- Total Context
- 131K
- Max Output
- 16K
- Std Input Price
- $0.05 /M
- Std Output Price
- $0.20 /M
- Batch Input Price
- $0.03 /M
- Batch Output Price
- $0.10 /M
IndexTTS
Model ID: indextts-2
IndexTTS-2
- Total Context
- Voice synthesis
- Max Output
- Audio
- Std Input Price
- $5.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $5.00 /M
- Batch Output Price
- $0.00 /M
Moonshot
Model ID: kimi-k2.5
Kimi K2.5
- Total Context
- 256K
- Max Output
- 64K
- Std Input Price
- $0.10 /M
- Std Output Price
- $1.00 /M
- Batch Input Price
- $0.05 /M
- Batch Output Price
- $0.50 /M
Meta
Model ID: llama-4-maverick
Llama 4 Maverick
- Total Context
- 1M
- Max Output
- 32K
- Std Input Price
- $0.06 /M
- Std Output Price
- $0.20 /M
- Batch Input Price
- $0.03 /M
- Batch Output Price
- $0.10 /M
Xiaomi
Model ID: mimo-v2-flash
MiMo-V2-Flash
- Total Context
- 128K
- Max Output
- 32K
- Std Input Price
- $0.01 /M
- Std Output Price
- $0.05 /M
- Batch Input Price
- $0.01 /M
- Batch Output Price
- $0.03 /M
MiniMax
Model ID: minimax-m2.5
MiniMax M2.5
- Total Context
- 192K
- Max Output
- 64K
- Std Input Price
- $0.10 /M
- Std Output Price
- $0.40 /M
- Batch Input Price
- $0.05 /M
- Batch Output Price
- $0.20 /M
NVIDIA
Model ID: nemotron-3-super
Nemotron 3 Super
- Total Context
- 1M
- Max Output
- 32K
- Std Input Price
- $0.04 /M
- Std Output Price
- $0.15 /M
- Batch Input Price
- $0.02 /M
- Batch Output Price
- $0.07 /M
Baidu
Model ID: paddleocr-vl-1.5
PaddleOCR-VL-1.5
- Total Context
- Document OCR
- Max Output
- Structured OCR
- Std Input Price
- $0.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.00 /M
- Batch Output Price
- $0.00 /M
Alibaba
Model ID: qwen3-32b
Qwen3-32B
- Total Context
- 256K
- Max Output
- 32K
- Std Input Price
- $0.02 /M
- Std Output Price
- $0.08 /M
- Batch Input Price
- $0.01 /M
- Batch Output Price
- $0.04 /M
Alibaba
Model ID: qwen3-tts
Qwen3-TTS
- Total Context
- Voice synthesis
- Max Output
- Audio
- Std Input Price
- $5.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $5.00 /M
- Batch Output Price
- $0.00 /M
Alibaba
Model ID: qwen3-vl-32b
Qwen3-VL-32B
- Total Context
- 262K
- Max Output
- 32K
- Std Input Price
- $0.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.00 /M
- Batch Output Price
- $0.00 /M
Alibaba
Model ID: qwen3.5-122b
Qwen3.5-122B-A10B
- Total Context
- 256K
- Max Output
- 32K
- Std Input Price
- $0.08 /M
- Std Output Price
- $0.55 /M
- Batch Input Price
- $0.04 /M
- Batch Output Price
- $0.28 /M
Alibaba
Model ID: qwen3.5-27b
Qwen3.5-27B
- Total Context
- 256K
- Max Output
- 32K
- Std Input Price
- $0.07 /M
- Std Output Price
- $0.50 /M
- Batch Input Price
- $0.04 /M
- Batch Output Price
- $0.25 /M
Alibaba
Model ID: qwen3.5-35b
Qwen3.5-35B-A3B
- Total Context
- 256K
- Max Output
- 32K
- Std Input Price
- $0.06 /M
- Std Output Price
- $0.45 /M
- Batch Input Price
- $0.03 /M
- Batch Output Price
- $0.23 /M
Alibaba
Model ID: qwen3.5-397b
Qwen3.5-397B-A17B
- Total Context
- 256K
- Max Output
- 64K
- Std Input Price
- $0.10 /M
- Std Output Price
- $0.05 /M
- Batch Input Price
- $0.05 /M
- Batch Output Price
- $0.03 /M
Alibaba
Model ID: qwen3.5-4b
Qwen3.5-4B
- Total Context
- 256K
- Max Output
- 16K
- Std Input Price
- $0.00 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.00 /M
- Batch Output Price
- $0.00 /M
Alibaba
Model ID: qwen3.5-9b
Qwen3.5-9B
- Total Context
- 256K
- Max Output
- 32K
- Std Input Price
- $0.05 /M
- Std Output Price
- $0.40 /M
- Batch Input Price
- $0.03 /M
- Batch Output Price
- $0.20 /M
StepFun
Model ID: step-3.5-flash
Step-3.5-Flash
- Total Context
- 128K
- Max Output
- 32K
- Std Input Price
- $0.01 /M
- Std Output Price
- $0.04 /M
- Batch Input Price
- $0.01 /M
- Batch Output Price
- $0.02 /M
Wan
Model ID: wan-2.2-i2v
Wan2.2-I2V
- Total Context
- Image + prompt
- Max Output
- 5s clip
- Std Input Price
- $0.15 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.15 /M
- Batch Output Price
- $0.00 /M
Wan
Model ID: wan-2.2
Wan2.2-T2V
- Total Context
- Prompt + reference
- Max Output
- 5s clip
- Std Input Price
- $0.15 /M
- Std Output Price
- $0.00 /M
- Batch Input Price
- $0.15 /M
- Batch Output Price
- $0.00 /M
Pricing Calculator
Estimate your cost by model, usage, and competitor.
BatchIn
$50.00
≈ ¥360.00
SiliconFlow
$63.25
≈ ¥455.40
Savings
$13.25
21%
Monthly Cost Bar Comparison
BatchIn$50.00
SiliconFlow$63.25
Competitor comparison and average savings are based on configured data. Avg savings: 21%
Dedicated GPU Rental
Reserve high-performance GPUs monthly for stable high-load inference and training.
- Dedicated isolated resources with predictable performance
- Supports 24/7 long-running jobs and high-throughput batch workloads
- Integrates with model scheduling and VaaS audit
What You Can Build
Build differentiated products around uncensored inference, batch processing, verifiable AI, multimodal workflows, and direct GPU control.
Uncensored Agents
Build research, red-team, creative, and workflow agents without adding hidden content-filter layers.
Batch Processing
Process millions of documents with 3-tier priority scheduling and a fill path optimized for the lowest-cost offline throughput.
Verifiable Inference™
Proof at every token. Every call creates an Ed25519-signed record with hash-chain linkage that can be verified in the browser.
Multi-modal
Cover text, code, image, video, speech, and embeddings from one platform instead of stitching together multiple backends.
Agent Payments
Design agent-native payment flows around USDC today and x402-style micropayments as the stack matures.
GPU Leasing
Lease dedicated GPU capacity with SSH root access so your team keeps the runtime, model stack, and operating rules.
Contact Us
Keep your focus on creating and building. Leave the rest to us.
Connect ideas, tasks, and execution in one continuous flow.
BatchIn Live
Multimodal generation: support chat, image, voice, and video creation workflows.
AI assistants: suitable for customer support, collaboration, documents, and data scenarios.
Agent deployment: stable, secure, and controllable endpoints for production agents.
Software development: accelerate coding across generation, completion, edits, and understanding.
Knowledge retrieval: connect private knowledge bases with real-time information for better accuracy.
Intelligent search: stronger retrieval, summarization, answering, and recommendation capabilities.
Upcoming Events
Join our next hackathon, webinar, or build challenge