diff --git a/lectures/memory-hierarchies/slides.qmd b/lectures/memory-hierarchies/slides.qmd index ba9f8509bd6119170c3d9bf301b4f85e837cb284..37a2eb63c973fe8da2eb5265fa2965be141e3606 100644 --- a/lectures/memory-hierarchies/slides.qmd +++ b/lectures/memory-hierarchies/slides.qmd @@ -492,6 +492,20 @@ $\approx$ Factor 20 between best and worst access +## Memory on Levante GPUs + + - For a NVIDIA A100 80GB GPU (4x in a Levante GPU node) + - Register and L1 Cache for one (of 108) Streaming Multiprocessor of a GPU + +| | Latency | Capacity | +| -------------------- | ------------ | ---------- | +| Register | ~1 ns | 4 x 64 KB | +| L1 Cache | a few ns | 192 KB | +| L2 Cache (shared) | ~10 ns | 40MB | +| Main Memory (HBM2e) | 10s of ns | 80 GB | + + + # Summary ## Observations