Scaling and Selecting Features
Keep the features that actually help.
Too Many Features Hurt
Text models can explode to tens of thousands of features. Many add only noise, so trimming the list often improves both speed and accuracy.
Why Scale at All
Some models compare feature magnitudes directly. If one feature ranges 0 to 1000, it can drown out the rest unless you scale them first.
All lessons in this course
- Beyond Bag-of-Words
- Character N-Grams for Robustness
- Combining Multiple Feature Types
- Scaling and Selecting Features