PRICING / TCO
$ / M tokens leaderboard
TCO efficiency per accelerator, recomputed on every build from the live case corpus
Formula
$/M tokens = (hw_rent_per_hour + tdp_w × PUE / 1000 × kWh_price) × 1,000,000 / (decode_tok_s_per_card × 3600)
assumptions:
hw_rent_per_hour = $2.50 USD / card / hour
kWh_price = $0.10 USD / kWh
PUE = $1.3
TDP = vendor-rated, per hardware
decode_tok_s = measured (Tier 0 case) ⚠ Compute-only BoM estimate — excludes datacenter amortization, networking, ops, licensing. Real production $/M tokens are typically 1.5–3× of this. Use for relative ranking, not absolute procurement quotes.
Best cost per card (18 cards with measured data)
| # | Hardware | Best $/M | Median | Worst | cases | Best case |
|---|---|---|---|---|---|---|
| 1 🏆 | NVIDIA H100 SXM5 80GB | $0.42 | $1.37 | $3.11 | 3 | detail → |
| 2 🏆 | NVIDIA H200 SXM 141GB | $1.20 | $2.40 | $2.40 | 2 | detail → |
| 3 🏆 | AMD Instinct MI355X | $1.32 | $1.32 | $1.32 | 1 | detail → |
| 4 | AMD Instinct MI325X | $1.89 | $1.89 | $1.89 | 1 | detail → |
| 5 | Intel Gaudi 3 | $2.01 | $2.01 | $2.01 | 1 | detail → |
| 6 | AMD Instinct MI300X | $2.62 | $2.62 | $2.62 | 1 | detail → |
| 7 | NVIDIA A100 SXM4 80GB | $3.83 | $3.83 | $3.83 | 1 | detail → |
| 8 | NVIDIA L40S | $4.88 | $4.88 | $4.88 | 1 | detail → |
| 9 | 沐曦 曦云 C500 🇨🇳 | $4.88 | $4.88 | $4.88 | 1 | detail → |
| 10 | 海光 DCU K100 🇨🇳 | $6.74 | $6.74 | $6.74 | 1 | detail → |
| 11 | AWS Trainium 2 | $12.67 | $12.67 | $12.67 | 1 | detail → |
| 12 | 昇腾 910B 🇨🇳 | $13.34 | $13.34 | $13.34 | 1 | detail → |
| 13 | 寒武纪 思元 590 🇨🇳 | $14.89 | $23.57 | $23.57 | 2 | detail → |
| 14 | 壁仞 BR104 🇨🇳 | $23.51 | $23.51 | $23.51 | 1 | detail → |
| 15 | Google TPU Trillium (v6e) | $31.05 | $31.05 | $31.05 | 1 | detail → |
| 16 | 摩尔线程 MTT S4000 🇨🇳 | $35.53 | $35.53 | $35.53 | 1 | detail → |
| 17 | 天数智芯 天垓 100 🇨🇳 | $51.29 | $51.29 | $51.29 | 1 | detail → |
| 18 | 昇腾 910C 🇨🇳 | $115.16 | $115.16 | $115.16 | 1 | detail → |
All cases · sorted by $/M tokens (22)
Want to tweak the assumptions? Open the calculator →
The TCO panel surfaces every assumption ($/card/hr, TDP, etc.) and accepts custom model/hardware/parallel configs.