The Vanishing Gradient Problem
Why plain RNNs forget.
A Frustrating Limit
Plain RNNs look powerful, but they struggle to remember long-range context. The cause is the vanishing gradient problem.
Learning Means Gradients
Networks learn by sending error signals, called gradients, backward through time to nudge each weight in the right direction.
All lessons in this course
- Why Order Matters in Language
- How an RNN Reads a Sequence
- Building an RNN Text Model
- The Vanishing Gradient Problem