AlphaFold 3
google DENSE text creative-commons-by-nc-4.0 2024-05-08
Architecture
Total params
0.21 B
Active params
0.21 B
Layers
48
Context
5 k
Detailed specs
Hidden size
384
FFN size
1536
Attention heads
16
KV heads
16
Head dim
24
Vocab size
64
Attention type
pair-bias-attention
Operator decomposition (per token)
| Operator | FLOPs / token | Bytes / token |
|---|---|---|
| pair-bias-attention | 2.40e+9 | 8.00e+8 |
| matmul | 1.20e+9 | 4.00e+8 |
| triangle-multiplication | 6.00e+8 | 6.00e+8 |
| rmsnorm | 8.00e+6 | 4.00e+6 |
Compatible hardware
— unknown AMD Instinct MI300A — unknown AMD Instinct MI300X — unknown AMD Instinct MI325X — unknown AMD Instinct MI355X — unknown Apple M4 Max Neural Engine — unknown AWS Inferentia 2 — unknown AWS Trainium 2 — unknown 壁仞 BR100 — unknown 壁仞 BR104 — unknown 寒武纪 MLU370-X8 — unknown 寒武纪 思元 590 — unknown Cerebras WSE-3 — unknown 燧原 云燧 T21 — unknown Etched Sohu — unknown Google TPU v5p — unknown Google TPU Trillium (v6e) — unknown Groq LPU (TSP v1) — unknown 昇腾 910B — unknown 昇腾 910C — unknown 昇腾 950 — unknown 海光 DCU K100 — unknown 海光 DCU Z100 — unknown 天数智芯 天垓 100 — unknown Intel Gaudi 2 — unknown Intel Gaudi 3 — unknown 沐曦 曦云 C500 — unknown 摩尔线程 MTT S4000 — unknown NVIDIA A100 SXM4 80GB — unknown NVIDIA B200 SXM 180GB — unknown NVIDIA B300 SXM 288GB — unknown NVIDIA GB200 NVL72 — unknown NVIDIA GB300 NVL72 — unknown NVIDIA H100 SXM5 80GB — unknown NVIDIA H200 SXM 141GB — unknown NVIDIA L40S — unknown NVIDIA R200 SXM (Vera Rubin) — unknown 平头哥 含光 800 — unknown SambaNova SN40L — unknown Tenstorrent Wormhole n300