VaaS — Verifiable Inference™

Verifiable InferenceProof at every token

AI InferenceNo Filters. No Limits

OpenAI-compatible inference for teams that want direct model access, signed audit trails, batch-first economics, and global USDC billing

Start with invite code

Models

Always-On

Cheaper

30-50%

VaaS

Every Call

Get Started in 3 Steps

OpenAI-compatible API, no code changes needed, up and running in 60 seconds

Sign Up & Get API Key

Create an account, copy your API key, and apply an invite code if you have one

batchin-sk-xxxx...

Change base_url

Using OpenAI SDK? Just change one line of code

client = OpenAI(
  base_url="https://api.luminapath.tech/v1",
  api_key="YOUR_KEY"
)

Start Inferencing

38 models across text, code, image, video, audio, and embeddings with streaming, batch inference, and VaaS audit

glm-5.1deepseek-v3.2qwen3-32bdeepseek-r1kimi-k2.5

Start Free →

Developer Trust

Switch to BatchIn in one line

OpenAI-compatible by default. Validate in Playground first, then move repeatable traffic into Batch.

Open Docs Open Playground

from openai import OpenAI

client = OpenAI(
    base_url="https://api.luminapath.tech/v1",
    api_key="YOUR_BATCHIN_KEY"
)

response = client.chat.completions.create(
    model="glm-5.1",
    messages=[{"role": "user", "content": "Summarize this meeting"}]
)

Model CatalogChat, Image, Video, TTS, Embeddings, and MORE

Affordable, Reliable, and Easy to Adopt for Teams of Any Size.

Batch Pricing AdvantageBatch 50% OFF

Z.ai

Model ID: glm-5.1

SWE-Bench Pro #1 · Open Source

GLM-5.1

Total Context: 198K
Max Output: 128K
Std Input Price: $0.50 /M
Std Output Price: $1.50 /M
Batch Input Price: $0.25 /M
Batch Output Price: $0.75 /M

Z.ai

Model ID: glm-5

text

GLM-5

Total Context: 198K
Max Output: 64K
Std Input Price: $0.35 /M
Std Output Price: $0.90 /M
Batch Input Price: $0.17 /M
Batch Output Price: $0.45 /M

BAAI

Model ID: bge-m3

embedding

BGE-M3

Total Context: 8K
Max Output: Vector
Std Input Price: $0.01 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.00 /M
Batch Output Price: $0.00 /M

CosyVoice

Model ID: cosyvoice2-0.5b

audio

CosyVoice2-0.5B

Total Context: Voice synthesis
Max Output: Audio
Std Input Price: $5.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $5.00 /M
Batch Output Price: $0.00 /M

DeepSeek

Model ID: deepseek-r1

text

DeepSeek R1

Total Context: 160K
Max Output: 64K
Std Input Price: $0.18 /M
Std Output Price: $0.60 /M
Batch Input Price: $0.09 /M
Batch Output Price: $0.30 /M

DeepSeek

Model ID: deepseek-v3

text

DeepSeek V3

Total Context: 160K
Max Output: 64K
Std Input Price: $0.08 /M
Std Output Price: $0.28 /M
Batch Input Price: $0.04 /M
Batch Output Price: $0.14 /M

DeepSeek

Model ID: deepseek-v3.1-terminus

text

DeepSeek V3.1 Terminus

Total Context: 160K
Max Output: 64K
Std Input Price: $0.10 /M
Std Output Price: $0.35 /M
Batch Input Price: $0.05 /M
Batch Output Price: $0.17 /M

DeepSeek

Model ID: deepseek-v3.2

text

DeepSeek V3.2

Total Context: 160K
Max Output: 64K
Std Input Price: $0.10 /M
Std Output Price: $0.15 /M
Batch Input Price: $0.05 /M
Batch Output Price: $0.07 /M

Mistral

Model ID: devstral-2

text

Devstral 2

Total Context: 128K
Max Output: 32K
Std Input Price: $0.06 /M
Std Output Price: $0.25 /M
Batch Input Price: $0.03 /M
Batch Output Price: $0.13 /M

Baidu

Model ID: ernie-4.5-300b

text

ERNIE 4.5-300B

Total Context: 131K
Max Output: 32K
Std Input Price: $0.10 /M
Std Output Price: $0.38 /M
Batch Input Price: $0.05 /M
Batch Output Price: $0.19 /M

Fish Audio

Model ID: fish-speech-1-5

audio

Fish-Speech-1.5

Total Context: Voice synthesis
Max Output: Audio
Std Input Price: $4.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $4.00 /M
Batch Output Price: $0.00 /M

Black Forest Labs

Model ID: flux-schnell

image

FLUX.1-dev

Total Context: Prompt-based
Max Output: Image
Std Input Price: $0.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.00 /M
Batch Output Price: $0.00 /M

Google

Model ID: gemma-4-27b

vision

Gemma 4 27B

Total Context: 128K
Max Output: 32K
Std Input Price: $0.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.00 /M
Batch Output Price: $0.00 /M

Z.ai

Model ID: glm-4.7

text

GLM-4.7

Total Context: 198K
Max Output: 64K
Std Input Price: $0.15 /M
Std Output Price: $0.80 /M
Batch Input Price: $0.07 /M
Batch Output Price: $0.40 /M

OpenAI OSS

Model ID: gpt-oss-120b

text

GPT-OSS-120B

Total Context: 131K
Max Output: 32K
Std Input Price: $0.02 /M
Std Output Price: $0.15 /M
Batch Input Price: $0.01 /M
Batch Output Price: $0.07 /M

OpenAI OSS

Model ID: gpt-oss-20b

text

GPT-OSS-20B

Total Context: 131K
Max Output: 16K
Std Input Price: $0.01 /M
Std Output Price: $0.06 /M
Batch Input Price: $0.01 /M
Batch Output Price: $0.03 /M

Tencent

Model ID: hunyuan-a13b

text

Hunyuan-A13B

Total Context: 131K
Max Output: 16K
Std Input Price: $0.05 /M
Std Output Price: $0.20 /M
Batch Input Price: $0.03 /M
Batch Output Price: $0.10 /M

IndexTTS

Model ID: indextts-2

audio

IndexTTS-2

Total Context: Voice synthesis
Max Output: Audio
Std Input Price: $5.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $5.00 /M
Batch Output Price: $0.00 /M

Moonshot

Model ID: kimi-k2.5

vision

Kimi K2.5

Total Context: 256K
Max Output: 64K
Std Input Price: $0.10 /M
Std Output Price: $1.00 /M
Batch Input Price: $0.05 /M
Batch Output Price: $0.50 /M

Llama 4 Maverick

Total Context: 1M
Max Output: 32K
Std Input Price: $0.06 /M
Std Output Price: $0.20 /M
Batch Input Price: $0.03 /M
Batch Output Price: $0.10 /M

Xiaomi

Model ID: mimo-v2-flash

text

MiMo-V2-Flash

Total Context: 128K
Max Output: 32K
Std Input Price: $0.01 /M
Std Output Price: $0.05 /M
Batch Input Price: $0.01 /M
Batch Output Price: $0.03 /M

MiniMax

Model ID: minimax-m2.5

vision

MiniMax M2.5

Total Context: 192K
Max Output: 64K
Std Input Price: $0.10 /M
Std Output Price: $0.40 /M
Batch Input Price: $0.05 /M
Batch Output Price: $0.20 /M

NVIDIA

Model ID: nemotron-3-super

text

Nemotron 3 Super

Total Context: 1M
Max Output: 32K
Std Input Price: $0.04 /M
Std Output Price: $0.15 /M
Batch Input Price: $0.02 /M
Batch Output Price: $0.07 /M

Baidu

Model ID: paddleocr-vl-1.5

vision

PaddleOCR-VL-1.5

Total Context: Document OCR
Max Output: Structured OCR
Std Input Price: $0.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.00 /M
Batch Output Price: $0.00 /M

Alibaba

Model ID: qwen3-32b

text

Qwen3-32B

Total Context: 256K
Max Output: 32K
Std Input Price: $0.02 /M
Std Output Price: $0.08 /M
Batch Input Price: $0.01 /M
Batch Output Price: $0.04 /M

Alibaba

Model ID: qwen3-tts

audio

Qwen3-TTS

Total Context: Voice synthesis
Max Output: Audio
Std Input Price: $5.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $5.00 /M
Batch Output Price: $0.00 /M

Alibaba

Model ID: qwen3-vl-32b

vision

Qwen3-VL-32B

Total Context: 262K
Max Output: 32K
Std Input Price: $0.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.00 /M
Batch Output Price: $0.00 /M

Alibaba

Model ID: qwen3.5-122b

text

Qwen3.5-122B-A10B

Total Context: 256K
Max Output: 32K
Std Input Price: $0.08 /M
Std Output Price: $0.55 /M
Batch Input Price: $0.04 /M
Batch Output Price: $0.28 /M

Alibaba

Model ID: qwen3.5-27b

text

Qwen3.5-27B

Total Context: 256K
Max Output: 32K
Std Input Price: $0.07 /M
Std Output Price: $0.50 /M
Batch Input Price: $0.04 /M
Batch Output Price: $0.25 /M

Alibaba

Model ID: qwen3.5-35b

text

Qwen3.5-35B-A3B

Total Context: 256K
Max Output: 32K
Std Input Price: $0.06 /M
Std Output Price: $0.45 /M
Batch Input Price: $0.03 /M
Batch Output Price: $0.23 /M

Alibaba

Model ID: qwen3.5-397b

text

Qwen3.5-397B-A17B

Total Context: 256K
Max Output: 64K
Std Input Price: $0.10 /M
Std Output Price: $0.05 /M
Batch Input Price: $0.05 /M
Batch Output Price: $0.03 /M

Alibaba

Model ID: qwen3.5-4b

text

Qwen3.5-4B

Total Context: 256K
Max Output: 16K
Std Input Price: $0.00 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.00 /M
Batch Output Price: $0.00 /M

Alibaba

Model ID: qwen3.5-9b

text

Qwen3.5-9B

Total Context: 256K
Max Output: 32K
Std Input Price: $0.05 /M
Std Output Price: $0.40 /M
Batch Input Price: $0.03 /M
Batch Output Price: $0.20 /M

StepFun

Model ID: step-3.5-flash

text

Step-3.5-Flash

Total Context: 128K
Max Output: 32K
Std Input Price: $0.01 /M
Std Output Price: $0.04 /M
Batch Input Price: $0.01 /M
Batch Output Price: $0.02 /M

Wan

Model ID: wan-2.2-i2v

video

Wan2.2-I2V

Total Context: Image + prompt
Max Output: 5s clip
Std Input Price: $0.15 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.15 /M
Batch Output Price: $0.00 /M

Wan

Model ID: wan-2.2

video

Wan2.2-T2V

Total Context: Prompt + reference
Max Output: 5s clip
Std Input Price: $0.15 /M
Std Output Price: $0.00 /M
Batch Input Price: $0.15 /M
Batch Output Price: $0.00 /M

Pricing Calculator

Estimate your cost by model, usage, and competitor.

ModelInput Tokens per month (M)

50M

Output Tokens per month (M)

50M

Compare Against

BatchIn

$50.00

≈ ¥360.00

SiliconFlow

$63.25

≈ ¥455.40

Savings

$13.25

21%

Monthly Cost Bar Comparison

BatchIn$50.00

SiliconFlow$63.25

Competitor comparison and average savings are based on configured data. Avg savings: 21%

Dedicated GPU Rental

Reserve high-performance GPUs monthly for stable high-load inference and training.

Dedicated isolated resources with predictable performance
Supports 24/7 long-running jobs and high-throughput batch workloads
Integrates with model scheduling and VaaS audit

Explore Dedicated GPU Plans

What You Can Build

Build differentiated products around uncensored inference, batch processing, verifiable AI, multimodal workflows, and direct GPU control.

Uncensored Agents

Build research, red-team, creative, and workflow agents without adding hidden content-filter layers.

Batch Processing

Process millions of documents with 3-tier priority scheduling and a fill path optimized for the lowest-cost offline throughput.

Verifiable Inference™

Proof at every token. Every call creates an Ed25519-signed record with hash-chain linkage that can be verified in the browser.

Multi-modal

Cover text, code, image, video, speech, and embeddings from one platform instead of stitching together multiple backends.

Agent Payments

Design agent-native payment flows around USDC today and x402-style micropayments as the stack matures.

GPU Leasing

Lease dedicated GPU capacity with SSH root access so your team keeps the runtime, model stack, and operating rules.

Contact Us

Keep your focus on creating and building. Leave the rest to us.

Connect ideas, tasks, and execution in one continuous flow.

BatchIn Live

Multimodal generation: support chat, image, voice, and video creation workflows.

AI assistants: suitable for customer support, collaboration, documents, and data scenarios.

Agent deployment: stable, secure, and controllable endpoints for production agents.

Software development: accelerate coding across generation, completion, edits, and understanding.

Knowledge retrieval: connect private knowledge bases with real-time information for better accuracy.

Intelligent search: stronger retrieval, summarization, answering, and recommendation capabilities.

Upcoming Events

Join our next hackathon, webinar, or build challenge

Hackathon

Upcoming

AI InferenceNo Filters. No Limits_

Get Started in 3 Steps

Sign Up & Get API Key

Change base_url

Start Inferencing

Switch to BatchIn in one line

GLM-5.1

GLM-5

BGE-M3

CosyVoice2-0.5B

DeepSeek R1

DeepSeek V3

DeepSeek V3.1 Terminus

DeepSeek V3.2

Devstral 2

ERNIE 4.5-300B

Fish-Speech-1.5

FLUX.1-dev

Gemma 4 27B

GLM-4.7

GPT-OSS-120B

GPT-OSS-20B

Hunyuan-A13B

IndexTTS-2

Kimi K2.5

Llama 4 Maverick

MiMo-V2-Flash

MiniMax M2.5

Nemotron 3 Super

PaddleOCR-VL-1.5

Qwen3-32B

Qwen3-TTS

Qwen3-VL-32B

Qwen3.5-122B-A10B

Qwen3.5-27B

Qwen3.5-35B-A3B

Qwen3.5-397B-A17B

Qwen3.5-4B

Qwen3.5-9B

Step-3.5-Flash

Wan2.2-I2V

Wan2.2-T2V

Pricing Calculator

Dedicated GPU Rental

Uncensored Agents

Batch Processing

Verifiable Inference™

Multi-modal

Agent Payments

GPU Leasing

Contact Us

BatchIn × GLM-5.1 + DeepSeek V4 Hackathon @ Boston Tech Week

Slash Your AI Inference Cost by 50%

Build Challenge: Best AI Agent in 7 Days

AI InferenceNo Filters. No Limits