Llama 4 Maverick

meta MOE text vision video Llama-4-Community 2025-04-05

架构

Total params
400 B
Active params
17 B
Layers
48
Context
1024 k

详细规格

Hidden size
5120
FFN size
16384
Attention heads
40
KV heads
8
Head dim
128
Vocab size
200000
Attention type
gqa
MoE experts
128
MoE top-k
1
Expert hidden
8192

算子拆解 (per token)

算子 FLOPs / token Bytes / token
matmul 2.42e+10 2.42e+10
attention 7.05e+9 1.09e+10
moe-gate 3.15e+7 4.03e+9
rmsnorm 2.46e+6 9.83e+5

兼容硬件