AlphaFold 3

google DENSE text creative-commons-by-nc-4.0 2024-05-08

架构

Total params
0.21 B
Active params
0.21 B
Layers
48
Context
5 k

详细规格

Hidden size
384
FFN size
1536
Attention heads
16
KV heads
16
Head dim
24
Vocab size
64
Attention type
pair-bias-attention

算子拆解 (per token)

算子 FLOPs / token Bytes / token
pair-bias-attention 2.40e+9 8.00e+8
matmul 1.20e+9 4.00e+8
triangle-multiplication 6.00e+8 6.00e+8
rmsnorm 8.00e+6 4.00e+6

兼容硬件