HanGuangAI
维护: vendor
unknown pingtouge
T-Head/Pingtouge inference stack for HanGuang 800; details limited in public docs
7 个推理引擎 · 硬件兼容 · 维护方信息
T-Head/Pingtouge inference stack for HanGuang 800; details limited in public docs
TurboMind backend; friendly to Chinese hardware ecosystems
Huawei official inference engine for Ascend (910B/910C, CloudMatrix)
New inference engine featured in InferenceX benchmarks
High-performance serving with RadixAttention and disaggregated prefill/decode
NVIDIA-only; deepest kernel optimizations and FP8/FP4 support
Most widely used; ascend / rocm / musa forks for non-NVIDIA hardware