0PricingLogin
NLP Academy · Lesson

Tokens, Spans, and Doc Objects

Navigate spaCy's data structures.

Three Core Objects

spaCy gives you three building blocks: the Doc, the Token, and the Span. Learn these and the whole library opens up.

The Doc Container

A Doc is the full processed document. It holds the original text plus every token and all the analysis spaCy produced.

doc = nlp("Maria leads the data team.")

All lessons in this course

  1. Why spaCy for Real Projects
  2. Loading a Model and Processing a Doc
  3. Tokens, Spans, and Doc Objects
  4. Customizing the Pipeline
← Back to NLP Academy