0Pricing
CUDA Academy · Lesson

Launching Kernels from a Kernel

Device-side recursion and refinement.

Kernels Launching Kernels

With dynamic parallelism, a running GPU kernel can launch another kernel itself, no trip back to the CPU required. 🚀

Same Syntax, Device Side

The launch looks identical to a host launch: the familiar triple angle brackets work right inside device code.

__global__ void child(int* d);
__global__ void parent(int* d) {
    child<<<1, 32>>>(d);
}

All lessons in this course

  1. Launching Kernels from a Kernel
  2. When Dynamic Parallelism Pays
  3. Capturing Work into a Graph
  4. Replaying Graphs to Cut Overhead
← Back to CUDA Academy