Threads, Blocks, and Grids
The GPU execution model.
The Smallest Worker
A GPU runs your kernel as many tiny workers. Each single worker is a thread, and it usually handles one piece of data.
Grouping Threads
Threads are organized into a block. Threads in the same block can cooperate and share fast on-chip memory.
All lessons in this course
- Why GPUs for AI Workloads
- Threads, Blocks, and Grids
- Writing a GPU Kernel Function
- Moving Data to and from Device