PRICING / TCO
$ / M tokens 排名
基于实测案例自动计算每张卡的成本效率, 答案随案例库增长持续更新
公式 / Formula
$/M tokens = (hw_rent_per_hour + tdp_w × PUE / 1000 × kWh_price) × 1,000,000 / (decode_tok_s_per_card × 3600)
assumptions:
hw_rent_per_hour = $2.50 USD / card / hour
kWh_price = $0.10 USD / kWh
PUE = $1.3
TDP = vendor-rated, per hardware
decode_tok_s = measured (Tier 0 case) ⚠ 这是纯推理 BoM 估算 — 不含数据中心摊销、网络、运维、license 等。实际生产 $/M tokens 通常 1.5-3× of this。用于横向对比, 不用于绝对采购报价。
每张卡最佳成本 (18 张卡有实测数据)
| # | 硬件 | 最佳 $/M | 中位 | 最差 | cases | 最佳案例 |
|---|---|---|---|---|---|---|
| 1 🏆 | NVIDIA H100 SXM5 80GB | $0.42 | $1.37 | $3.11 | 3 | 详情 → |
| 2 🏆 | NVIDIA H200 SXM 141GB | $1.20 | $2.40 | $2.40 | 2 | 详情 → |
| 3 🏆 | AMD Instinct MI355X | $1.32 | $1.32 | $1.32 | 1 | 详情 → |
| 4 | AMD Instinct MI325X | $1.89 | $1.89 | $1.89 | 1 | 详情 → |
| 5 | Intel Gaudi 3 | $2.01 | $2.01 | $2.01 | 1 | 详情 → |
| 6 | AMD Instinct MI300X | $2.62 | $2.62 | $2.62 | 1 | 详情 → |
| 7 | NVIDIA A100 SXM4 80GB | $3.83 | $3.83 | $3.83 | 1 | 详情 → |
| 8 | NVIDIA L40S | $4.88 | $4.88 | $4.88 | 1 | 详情 → |
| 9 | 沐曦 曦云 C500 🇨🇳 | $4.88 | $4.88 | $4.88 | 1 | 详情 → |
| 10 | 海光 DCU K100 🇨🇳 | $6.74 | $6.74 | $6.74 | 1 | 详情 → |
| 11 | AWS Trainium 2 | $12.67 | $12.67 | $12.67 | 1 | 详情 → |
| 12 | 昇腾 910B 🇨🇳 | $13.34 | $13.34 | $13.34 | 1 | 详情 → |
| 13 | 寒武纪 思元 590 🇨🇳 | $14.89 | $23.57 | $23.57 | 2 | 详情 → |
| 14 | 壁仞 BR104 🇨🇳 | $23.51 | $23.51 | $23.51 | 1 | 详情 → |
| 15 | Google TPU Trillium (v6e) | $31.05 | $31.05 | $31.05 | 1 | 详情 → |
| 16 | 摩尔线程 MTT S4000 🇨🇳 | $35.53 | $35.53 | $35.53 | 1 | 详情 → |
| 17 | 天数智芯 天垓 100 🇨🇳 | $51.29 | $51.29 | $51.29 | 1 | 详情 → |
| 18 | 昇腾 910C 🇨🇳 | $115.16 | $115.16 | $115.16 | 1 | 详情 → |
全部案例 · 按 $/M tokens 升序 (22)
想自己调整假设? 打开计算器 → TCO 面板里 $/卡/小时 和 TDP 都可以改, 也支持自定义模型/硬件/并行配置。