GraphCast

google DENSE text creative-commons-by-nc-sa-4.0 2023-11-14

架构

Total params
0.0367 B
Active params
0.0367 B
Layers
16
Context
1014 k

详细规格

Hidden size
512
FFN size
1024
Attention heads
8
KV heads
8
Head dim
64
Vocab size
256
Attention type
graph-message-passing

算子拆解 (per token)

算子 FLOPs / token Bytes / token
graph-message-passing 2.80e+8 2.80e+8
matmul 1.80e+8 6.00e+7
scatter-gather 4.00e+7 2.00e+8
layernorm 4.00e+6 2.00e+6

兼容硬件