0PricingLogin
NLP Academy · Lesson

Term Frequency and Inverse Document Frequency

The two halves of the score.

Two Halves of One Score

TF-IDF is built from two pieces that multiply together: term frequency and inverse document frequency. Each half fixes a different weakness of raw counts.

Term Frequency, Plainly

Term frequency measures how often a word appears inside one document. More mentions of a word suggest that document leans toward that topic.

All lessons in this course

  1. The Problem With Raw Counts
  2. Term Frequency and Inverse Document Frequency
  3. TF-IDF With scikit-learn
  4. Finding the Most Important Words
← Back to NLP Academy