The Occupancy Calculator API
cudaOccupancyMaxPotentialBlockSize.
Stop Guessing Block Size
Instead of hand-tuning, CUDA offers an occupancy API that computes a good block size for your kernel automatically.
The Star Function
One call does the heavy lifting: cudaOccupancyMaxPotentialBlockSize suggests a block size that maximizes occupancy.
cudaOccupancyMaxPotentialBlockSize(&grid, &block, myKernel);All lessons in this course
- What Occupancy Really Means
- Registers and Shared Memory Limits
- The Occupancy Calculator API
- Occupancy Is Not the Whole Story