DeepSeek
深度求索
模型
3
部署案例
7
模型 (3)
部署案例 (7)
- DeepSeek R1 on 16× Ascend 910B with MindIEascend-910b ×16 · deepseek-r1 · 850 tok/s
- DeepSeek V4 Flash on 8×H100 SXM with vLLM FP8h100-sxm5 ×8 · deepseek-v4-flash · 4200 tok/s
- DeepSeek V4 Pro on Huawei CloudMatrix 384 with MindIEascend-910c ×384 · deepseek-v4-pro · 2400 tok/s
- DeepSeek V4 Flash with disaggregated prefill (H100) + decode (H200) via Mooncakeh200-sxm ×16 · deepseek-v4-flash · 9600 tok/s
- DeepSeek V4 Flash on 16× MTT S4000 (Moore Threads KUAE)mtt-s4000 ×16 · deepseek-v4-flash · 320 tok/s
- DeepSeek V3 on AWS Trainium 2 (64-chip Trn2 instance)trainium-2 ×64 · deepseek-r1 · 3600 tok/s
- DeepSeek R1 on 16× Iluvatar 天垓 100 (Iluvatar IxRT)iluvatar-bi ×16 · deepseek-r1 · 220 tok/s