Why Models Need Numbers, Not Words
The leap from text to vectors.
Machines Speak Numbers
A model cannot do math on the word cat. Every algorithm under the hood only understands numbers, so text must be converted first.
The Core Problem
Your goal is to turn each document into a row of numbers, a vector, that captures what the text contains.
All lessons in this course
- Why Models Need Numbers, Not Words
- Building a Vocabulary
- Counting With CountVectorizer
- Reading the Document-Term Matrix