0Pricing
CUDA Academy · Lesson

Kernel Metrics in Nsight Compute

Throughput, stalls, and roofline data.

Zoom Into One Kernel

Nsight Compute is the microscope for a single kernel. It collects deep hardware metrics so you can see exactly why it runs slow. 🔬

How to Launch It

You profile a kernel with ncu from the command line. It replays the kernel many times to gather detailed counters.

ncu -o report ./my_cuda_app

All lessons in this course

  1. Timeline View in Nsight Systems
  2. Kernel Metrics in Nsight Compute
  3. Compute-Bound vs Memory-Bound
  4. Annotating Code with NVTX
← Back to CUDA Academy