0PricingLogin
CUDA Academy · Lesson

Sequential Addressing

Conflict-free strides in shared memory.

Shared Memory Has Banks

Shared memory is split into 32 banks, one per warp lane. When 32 threads hit 32 different banks, all reads happen in a single fast cycle.

Bank Conflicts Slow You Down

If two threads in a warp touch the same bank, that is a bank conflict. The hardware serializes those accesses, costing extra cycles.

All lessons in this course

  1. The Reduction Tree Idea
  2. Killing Warp Divergence
  3. Sequential Addressing
  4. Multi-Block Final Reduction
← Back to CUDA Academy