SWE-Bench Pro #1
Open-source coding flagship built for long-horizon autonomous engineering and deep reasoning.
- Release
- Apr 07, 2026
- Release
- Apr 07, 2026
- Max Output
- 128K
- Pricing
- $0.500 / $1.500
Cached Input: $0.175
MITOpen Sourcereasoningcodingfeatured
View model detailLower-cost GLM route for production reasoning, agents, and long-context workflows.
- Release
- 2026
- Release
- 2026
- Max Output
- 64K
- Pricing
- $0.350 / $0.900
Cached Input: $0.122
Apache 2.0Open Sourcereasoningworkflow
View model detailSWE-bench 73.8%
Mid-tier GLM reasoning route for engineering teams that need quality below flagship spend.
- Release
- 2026
- Release
- 2026
- Max Output
- 64K
- Pricing
- $0.150 / $0.800
Cached Input: $0.052
Apache 2.0Open Sourcereasoningcoding
View model detailo1-class reasoning
Heavy reasoning model for difficult planning, math, research, and multi-step analysis.
- Release
- 2025
- Release
- 2025
- Max Output
- 64K
- Pricing
- $0.180 / $0.600
Cached Input: $0.063
MITOpen Sourcereasoningmathresearch
View model detailIMO + IOI gold
Flagship DeepSeek release tuned for strong general reasoning at a very aggressive price point.
- Release
- 2026
- Release
- 2026
- Max Output
- 64K
- Pricing
- $0.100 / $0.150
Cached Input: $0.035
MITOpen Sourcereasoningfeatured
View model detail
DeepSeek
deepseek-v3.1-terminus
DeepSeek V3.1 Terminus
LiveHigher-output DeepSeek route for workflows that need longer structured completions.
- Release
- 2026
- Release
- 2026
- Max Output
- 64K
- Pricing
- $0.100 / $0.350
Cached Input: $0.035
MITOpen Sourcereasoningworkflow
View model detailStable general-purpose DeepSeek route for large-scale chat and batch workloads.
- Release
- 2025
- Release
- 2025
- Max Output
- 64K
- Pricing
- $0.080 / $0.280
Cached Input: $0.028
MITOpen Sourcechatbatch
View model detailBalanced mid-large Qwen route for general chat, coding, and production assistant workloads.
- Release
- 2026
- Release
- 2026
- Max Output
- 32K
- Pricing
- $0.020 / $0.080
Cached Input: $0.007
Apache 2.0Open Sourceqwengeneral
View model detail201 languages
Top-tier Qwen MoE model for multilingual reasoning, coding, and large-context assistants.
- Release
- 2026
- Release
- 2026
- Max Output
- 64K
- Pricing
- $0.100 / $0.050
Cached Input: $0.035
Apache 2.0Open Sourcemultilingualreasoning
View model detailBalanced Qwen MoE for long-context assistants and cost-conscious production routing.
- Release
- 2026
- Release
- 2026
- Max Output
- 32K
- Pricing
- $0.080 / $0.550
Cached Input: $0.028
Apache 2.0Open Sourceqwenlong-context
View model detailLower-cost MoE Qwen route for product copilots and high-volume assistant traffic.
- Release
- 2026
- Release
- 2026
- Max Output
- 32K
- Pricing
- $0.060 / $0.450
Cached Input: $0.021
Apache 2.0Open Sourceqwenmoe
View model detailexceeds GPT-5-mini
Lean Qwen route aimed at lower-cost chat, agent routing, and product copilot features.
- Release
- 2026
- Release
- 2026
- Max Output
- 32K
- Pricing
- $0.070 / $0.500
Cached Input: $0.025
Apache 2.0Open Sourceqwenmid-tier
View model detail