← 厂商目录

DeepSeek

深度求索

CN model-lab 官网 ↗

模型

3

部署案例

7

模型 (3)

DeepSeek V4 Flash

284B MOE · 2026-04-24

DeepSeek V4 Pro

1600B MOE · 2026-04-24

671B MOE · 2025-01-20

部署案例 (7)

DeepSeek R1 on 16× Ascend 910B with MindIE

ascend-910b ×16 · deepseek-r1 · 850 tok/s
DeepSeek V4 Flash on 8×H100 SXM with vLLM FP8

h100-sxm5 ×8 · deepseek-v4-flash · 4200 tok/s
DeepSeek V4 Pro on Huawei CloudMatrix 384 with MindIE

ascend-910c ×384 · deepseek-v4-pro · 2400 tok/s
DeepSeek V4 Flash with disaggregated prefill (H100) + decode (H200) via Mooncake

h200-sxm ×16 · deepseek-v4-flash · 9600 tok/s
DeepSeek V4 Flash on 16× MTT S4000 (Moore Threads KUAE)

mtt-s4000 ×16 · deepseek-v4-flash · 320 tok/s
DeepSeek V3 on AWS Trainium 2 (64-chip Trn2 instance)

trainium-2 ×64 · deepseek-r1 · 3600 tok/s
DeepSeek R1 on 16× Iluvatar 天垓 100 (Iluvatar IxRT)

iluvatar-bi ×16 · deepseek-r1 · 220 tok/s