Measuring the Speedup
Comparing naive vs tiled performance.
Prove the Win
You built a tiled kernel, but how much faster is it really? Measuring turns a guess into a number you can trust. 📊
Time on the GPU's Clock
Use CUDA events to time kernels. They sit in the GPU stream and measure exactly when work starts and finishes.
cudaEvent_t start, stop;
cudaEventCreate(&start);
cudaEventCreate(&stop);All lessons in this course
- The Naive Matmul Kernel
- Tiling the Inner Product
- Looping Over Tile Phases
- Measuring the Speedup