ResNet: Skip Connections Go Deep
Residuals that beat vanishing gradients.
Deeper Got Worse
Surprisingly, plain nets past a point trained worse as they got deeper. More layers should not hurt, yet they did. Something was broken.
The Real Culprit
Gradients shrank as they flowed back through many layers, a problem called vanishing gradients. Deep nets stopped learning.
All lessons in this course
- LeNet & AlexNet: The First Wins
- VGG: Stacks of Small Filters
- ResNet: Skip Connections Go Deep
- Load torchvision Models