Why Rare Classes Get Ignored
The cost of skewed labels.
Imbalance, Defined
When one label hugely outnumbers another, your data is imbalanced. Think 9,500 normal emails versus 500 spam ones.
The Lazy Shortcut
A model wants high accuracy fast. The easy win is to always predict the majority class and quietly ignore the rare one. 😬
All lessons in this course
- Why Rare Classes Get Ignored
- Resampling and Class Weights
- Choosing Threshold and Metric
- End-to-End Imbalanced Pipeline