0PricingLogin
Deep Learning Academy · Lesson

The Chain Rule, Layer by Layer

Compose derivatives through the network.

A Net Is Nested Functions

A neural network is just functions wrapped inside functions: each layer feeds its output into the next. Backprop needs the derivative of that whole stack.

The Chain Rule in One Line

The chain rule says the derivative of nested functions multiplies the derivatives of each piece. That single fact is the engine behind all of backprop. ⛓️

dy/dx = dy/du * du/dx

All lessons in this course

  1. The Chain Rule, Layer by Layer
  2. Forward Caches, Backward Reuses
  3. Backprop a Tiny Net by Hand
  4. Vanishing & Exploding Gradients
← Back to Deep Learning Academy