AlphaFold 3

google DENSE text creative-commons-by-nc-4.0 2024-05-08

Architecture

Total params
0.21 B
Active params
0.21 B
Layers
48
Context
5 k

Detailed specs

Hidden size
384
FFN size
1536
Attention heads
16
KV heads
16
Head dim
24
Vocab size
64
Attention type
pair-bias-attention

Operator decomposition (per token)

Operator FLOPs / token Bytes / token
pair-bias-attention 2.40e+9 8.00e+8
matmul 1.20e+9 4.00e+8
triangle-multiplication 6.00e+8 6.00e+8
rmsnorm 8.00e+6 4.00e+6

Compatible hardware