GPU Leasing

Lease dedicated GPU capacity with SSH access, direct control, and no platform filter layer.

For teams that need to own the runtime, BatchIn offers GPU leasing paths that keep infrastructure out of the procurement maze and inside an operator-friendly workflow.

H200

$1.80/hr

A800

$1.00/hr

Access

SSH root

Deploy

Your stack

Hardware options for real workloads

Choose the accelerator profile that fits your model size, latency target, and deployment timeline.

  • H200 for top-end memory headroom and flagship model serving.
  • A800 for cost-sensitive throughput and regional deployment needs.
  • 910C pathways for teams optimizing around domestic hardware supply.

You keep the runtime

GPU leasing is for cases where serverless is the wrong abstraction. BatchIn provides capacity; your team controls the operating model.

  • SSH root access for custom runtimes, schedulers, and observability agents.
  • Bring your own model stack, checkpoints, quantization, and deployment workflow.
  • Pair leased infrastructure with BatchIn billing and audit products only when you want them.

Current GPU lineup

Reference pricing and positioning for the current leasing menu.

GPU$/GPU-hrVRAMArchitectureBest forAvailability
B200TBD192GB HBM3eBlackwellNext-gen flagship and FP4-native workloadsM2+
H200$1.80141GB HBM3eHopperLarge MoE and flagship inferenceDay-1
H100$1.5080GB HBM3HopperIndustry-standard production servingM1
H20$1.2096GB HBM3Hopper (China)Long-context inference and China-optimized rolloutM1
A800$1.0080GB HBM2eAmpere (China)Mid-size models and cost-optimized servingDay-1
910C$0.8064GB HBM2eAscend (Huawei)Domestic projects and lowest-cost deployment pathDay-1
L40S$0.6048GB GDDR6XAda LovelaceImage, video, and embedding inferenceM1