Term Frequency and Inverse Document Frequency
The two halves of the score.
Two Halves of One Score
TF-IDF is built from two pieces that multiply together: term frequency and inverse document frequency. Each half fixes a different weakness of raw counts.
Term Frequency, Plainly
Term frequency measures how often a word appears inside one document. More mentions of a word suggest that document leans toward that topic.
All lessons in this course
- The Problem With Raw Counts
- Term Frequency and Inverse Document Frequency
- TF-IDF With scikit-learn
- Finding the Most Important Words