VaaS — Verifiable Inference™
Verifiable InferenceProof at every token

AI InferenceNo Filters. No Limits

OpenAI-compatible inference for teams that want direct model access, signed audit trails, batch-first economics, and global USDC billing

Models
38
Always-On
36
Cheaper
30-50%
VaaS
Every Call
Hero portrait

Get Started in 3 Steps

OpenAI-compatible API, no code changes needed, up and running in 60 seconds

1

Sign Up & Get API Key

Create an account, copy your API key, and apply an invite code if you have one

batchin-sk-xxxx...
2

Change base_url

Using OpenAI SDK? Just change one line of code

client = OpenAI(
  base_url="https://api.luminapath.tech/v1",
  api_key="YOUR_KEY"
)
3

Start Inferencing

38 models across text, code, image, video, audio, and embeddings with streaming, batch inference, and VaaS audit

glm-5.1deepseek-v3.2qwen3-32bdeepseek-r1kimi-k2.5
Qwen
DeepSeek
MiniMax
FLUX
CosyVoice
bge-m3
Meta
NVIDIA
ONNX
LLM
Z.ai
Developer Trust

Switch to BatchIn in one line

OpenAI-compatible by default. Validate in Playground first, then move repeatable traffic into Batch.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.luminapath.tech/v1",
    api_key="YOUR_BATCHIN_KEY"
)

response = client.chat.completions.create(
    model="glm-5.1",
    messages=[{"role": "user", "content": "Summarize this meeting"}]
)

Model CatalogChat, Image, Video, TTS, Embeddings, and MORE

Affordable, Reliable, and Easy to Adopt for Teams of Any Size.

Batch Pricing AdvantageBatch 50% OFF

Z.ai

Model ID: glm-5.1

SWE-Bench Pro #1 · Open Source

GLM-5.1

Total Context
198K
Max Output
128K
Std Input Price
$0.50 /M
Std Output Price
$1.50 /M
Batch Input Price
$0.25 /M
Batch Output Price
$0.75 /M

Z.ai

Model ID: glm-5

text

GLM-5

Total Context
198K
Max Output
64K
Std Input Price
$0.35 /M
Std Output Price
$0.90 /M
Batch Input Price
$0.17 /M
Batch Output Price
$0.45 /M

BAAI

Model ID: bge-m3

embedding

BGE-M3

Total Context
8K
Max Output
Vector
Std Input Price
$0.01 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.00 /M
Batch Output Price
$0.00 /M

CosyVoice

Model ID: cosyvoice2-0.5b

audio

CosyVoice2-0.5B

Total Context
Voice synthesis
Max Output
Audio
Std Input Price
$5.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$5.00 /M
Batch Output Price
$0.00 /M

DeepSeek

Model ID: deepseek-r1

text

DeepSeek R1

Total Context
160K
Max Output
64K
Std Input Price
$0.18 /M
Std Output Price
$0.60 /M
Batch Input Price
$0.09 /M
Batch Output Price
$0.30 /M

DeepSeek

Model ID: deepseek-v3

text

DeepSeek V3

Total Context
160K
Max Output
64K
Std Input Price
$0.08 /M
Std Output Price
$0.28 /M
Batch Input Price
$0.04 /M
Batch Output Price
$0.14 /M

DeepSeek

Model ID: deepseek-v3.1-terminus

text

DeepSeek V3.1 Terminus

Total Context
160K
Max Output
64K
Std Input Price
$0.10 /M
Std Output Price
$0.35 /M
Batch Input Price
$0.05 /M
Batch Output Price
$0.17 /M

DeepSeek

Model ID: deepseek-v3.2

text

DeepSeek V3.2

Total Context
160K
Max Output
64K
Std Input Price
$0.10 /M
Std Output Price
$0.15 /M
Batch Input Price
$0.05 /M
Batch Output Price
$0.07 /M

Mistral

Model ID: devstral-2

text

Devstral 2

Total Context
128K
Max Output
32K
Std Input Price
$0.06 /M
Std Output Price
$0.25 /M
Batch Input Price
$0.03 /M
Batch Output Price
$0.13 /M

Baidu

Model ID: ernie-4.5-300b

text

ERNIE 4.5-300B

Total Context
131K
Max Output
32K
Std Input Price
$0.10 /M
Std Output Price
$0.38 /M
Batch Input Price
$0.05 /M
Batch Output Price
$0.19 /M

Fish Audio

Model ID: fish-speech-1-5

audio

Fish-Speech-1.5

Total Context
Voice synthesis
Max Output
Audio
Std Input Price
$4.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$4.00 /M
Batch Output Price
$0.00 /M

Black Forest Labs

Model ID: flux-schnell

image

FLUX.1-dev

Total Context
Prompt-based
Max Output
Image
Std Input Price
$0.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.00 /M
Batch Output Price
$0.00 /M

Google

Model ID: gemma-4-27b

vision

Gemma 4 27B

Total Context
128K
Max Output
32K
Std Input Price
$0.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.00 /M
Batch Output Price
$0.00 /M

Z.ai

Model ID: glm-4.7

text

GLM-4.7

Total Context
198K
Max Output
64K
Std Input Price
$0.15 /M
Std Output Price
$0.80 /M
Batch Input Price
$0.07 /M
Batch Output Price
$0.40 /M

OpenAI OSS

Model ID: gpt-oss-120b

text

GPT-OSS-120B

Total Context
131K
Max Output
32K
Std Input Price
$0.02 /M
Std Output Price
$0.15 /M
Batch Input Price
$0.01 /M
Batch Output Price
$0.07 /M

OpenAI OSS

Model ID: gpt-oss-20b

text

GPT-OSS-20B

Total Context
131K
Max Output
16K
Std Input Price
$0.01 /M
Std Output Price
$0.06 /M
Batch Input Price
$0.01 /M
Batch Output Price
$0.03 /M

Tencent

Model ID: hunyuan-a13b

text

Hunyuan-A13B

Total Context
131K
Max Output
16K
Std Input Price
$0.05 /M
Std Output Price
$0.20 /M
Batch Input Price
$0.03 /M
Batch Output Price
$0.10 /M

IndexTTS

Model ID: indextts-2

audio

IndexTTS-2

Total Context
Voice synthesis
Max Output
Audio
Std Input Price
$5.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$5.00 /M
Batch Output Price
$0.00 /M

Moonshot

Model ID: kimi-k2.5

vision

Kimi K2.5

Total Context
256K
Max Output
64K
Std Input Price
$0.10 /M
Std Output Price
$1.00 /M
Batch Input Price
$0.05 /M
Batch Output Price
$0.50 /M

Meta

Model ID: llama-4-maverick

text

Llama 4 Maverick

Total Context
1M
Max Output
32K
Std Input Price
$0.06 /M
Std Output Price
$0.20 /M
Batch Input Price
$0.03 /M
Batch Output Price
$0.10 /M

Xiaomi

Model ID: mimo-v2-flash

text

MiMo-V2-Flash

Total Context
128K
Max Output
32K
Std Input Price
$0.01 /M
Std Output Price
$0.05 /M
Batch Input Price
$0.01 /M
Batch Output Price
$0.03 /M

MiniMax

Model ID: minimax-m2.5

vision

MiniMax M2.5

Total Context
192K
Max Output
64K
Std Input Price
$0.10 /M
Std Output Price
$0.40 /M
Batch Input Price
$0.05 /M
Batch Output Price
$0.20 /M

NVIDIA

Model ID: nemotron-3-super

text

Nemotron 3 Super

Total Context
1M
Max Output
32K
Std Input Price
$0.04 /M
Std Output Price
$0.15 /M
Batch Input Price
$0.02 /M
Batch Output Price
$0.07 /M

Baidu

Model ID: paddleocr-vl-1.5

vision

PaddleOCR-VL-1.5

Total Context
Document OCR
Max Output
Structured OCR
Std Input Price
$0.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.00 /M
Batch Output Price
$0.00 /M

Alibaba

Model ID: qwen3-32b

text

Qwen3-32B

Total Context
256K
Max Output
32K
Std Input Price
$0.02 /M
Std Output Price
$0.08 /M
Batch Input Price
$0.01 /M
Batch Output Price
$0.04 /M

Alibaba

Model ID: qwen3-tts

audio

Qwen3-TTS

Total Context
Voice synthesis
Max Output
Audio
Std Input Price
$5.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$5.00 /M
Batch Output Price
$0.00 /M

Alibaba

Model ID: qwen3-vl-32b

vision

Qwen3-VL-32B

Total Context
262K
Max Output
32K
Std Input Price
$0.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.00 /M
Batch Output Price
$0.00 /M

Alibaba

Model ID: qwen3.5-122b

text

Qwen3.5-122B-A10B

Total Context
256K
Max Output
32K
Std Input Price
$0.08 /M
Std Output Price
$0.55 /M
Batch Input Price
$0.04 /M
Batch Output Price
$0.28 /M

Alibaba

Model ID: qwen3.5-27b

text

Qwen3.5-27B

Total Context
256K
Max Output
32K
Std Input Price
$0.07 /M
Std Output Price
$0.50 /M
Batch Input Price
$0.04 /M
Batch Output Price
$0.25 /M

Alibaba

Model ID: qwen3.5-35b

text

Qwen3.5-35B-A3B

Total Context
256K
Max Output
32K
Std Input Price
$0.06 /M
Std Output Price
$0.45 /M
Batch Input Price
$0.03 /M
Batch Output Price
$0.23 /M

Alibaba

Model ID: qwen3.5-397b

text

Qwen3.5-397B-A17B

Total Context
256K
Max Output
64K
Std Input Price
$0.10 /M
Std Output Price
$0.05 /M
Batch Input Price
$0.05 /M
Batch Output Price
$0.03 /M

Alibaba

Model ID: qwen3.5-4b

text

Qwen3.5-4B

Total Context
256K
Max Output
16K
Std Input Price
$0.00 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.00 /M
Batch Output Price
$0.00 /M

Alibaba

Model ID: qwen3.5-9b

text

Qwen3.5-9B

Total Context
256K
Max Output
32K
Std Input Price
$0.05 /M
Std Output Price
$0.40 /M
Batch Input Price
$0.03 /M
Batch Output Price
$0.20 /M

StepFun

Model ID: step-3.5-flash

text

Step-3.5-Flash

Total Context
128K
Max Output
32K
Std Input Price
$0.01 /M
Std Output Price
$0.04 /M
Batch Input Price
$0.01 /M
Batch Output Price
$0.02 /M

Wan

Model ID: wan-2.2-i2v

video

Wan2.2-I2V

Total Context
Image + prompt
Max Output
5s clip
Std Input Price
$0.15 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.15 /M
Batch Output Price
$0.00 /M

Wan

Model ID: wan-2.2

video

Wan2.2-T2V

Total Context
Prompt + reference
Max Output
5s clip
Std Input Price
$0.15 /M
Std Output Price
$0.00 /M
Batch Input Price
$0.15 /M
Batch Output Price
$0.00 /M

Pricing Calculator

Estimate your cost by model, usage, and competitor.

BatchIn

$50.00

≈ ¥360.00

SiliconFlow

$63.25

≈ ¥455.40

Savings

$13.25

21%

Monthly Cost Bar Comparison

BatchIn$50.00
SiliconFlow$63.25

Competitor comparison and average savings are based on configured data. Avg savings: 21%

Dedicated GPU Rental

Reserve high-performance GPUs monthly for stable high-load inference and training.

  • Dedicated isolated resources with predictable performance
  • Supports 24/7 long-running jobs and high-throughput batch workloads
  • Integrates with model scheduling and VaaS audit

What You Can Build

Build differentiated products around uncensored inference, batch processing, verifiable AI, multimodal workflows, and direct GPU control.

Uncensored Agents

Build research, red-team, creative, and workflow agents without adding hidden content-filter layers.

Batch Processing

Process millions of documents with 3-tier priority scheduling and a fill path optimized for the lowest-cost offline throughput.

Verifiable Inference™

Proof at every token. Every call creates an Ed25519-signed record with hash-chain linkage that can be verified in the browser.

Multi-modal

Cover text, code, image, video, speech, and embeddings from one platform instead of stitching together multiple backends.

Agent Payments

Design agent-native payment flows around USDC today and x402-style micropayments as the stack matures.

GPU Leasing

Lease dedicated GPU capacity with SSH root access so your team keeps the runtime, model stack, and operating rules.

Contact Us

Keep your focus on creating and building. Leave the rest to us.

Connect ideas, tasks, and execution in one continuous flow.

Contact visual
BatchIn Live
Multimodal generation: support chat, image, voice, and video creation workflows.
AI assistants: suitable for customer support, collaboration, documents, and data scenarios.
Agent deployment: stable, secure, and controllable endpoints for production agents.
Software development: accelerate coding across generation, completion, edits, and understanding.
Knowledge retrieval: connect private knowledge bases with real-time information for better accuracy.
Intelligent search: stronger retrieval, summarization, answering, and recommendation capabilities.