0Pricing
NLP Academy · Lesson

What Is a Token, Really?

Words, punctuation, and the unit of NLP.

Meet the Token

A token is the smallest unit of text your code works with. Before any analysis, you chop a sentence into these tiny pieces. 🧩

Usually a Word

Most of the time a token is just a word. The sentence I love cats becomes three tokens: I, love, and cats.

All lessons in this course

  1. What Is a Token, Really?
  2. Splitting on Whitespace and Its Limits
  3. Sentence Segmentation Basics
  4. Tokenizing With NLTK
← Back to NLP Academy