The Thread Hierarchy
Threads group into blocks, blocks into a grid.
Many Threads, One Job
A CUDA kernel runs in parallel across many threads, and each thread does the same work on its own slice of the data. 🧵
Threads Need Structure
Launching millions of loose threads would be chaos. CUDA organizes them into a clean hierarchy so the hardware can schedule them well.
All lessons in this course
- The Thread Hierarchy
- threadIdx, blockIdx, blockDim
- Why Blocks Exist
- Choosing Threads per Block