0PricingLogin
NLP Academy · Lesson

Running LDA With Gensim

Fit topics on a real corpus.

Meet Gensim

Gensim is a Python library built for topic modeling. It makes running LDA on real text fast and approachable. 🐍

from gensim import corpora, models

Start With Tokens

Gensim expects each document as a list of clean tokens. You bring already lowercased, stopword-free word lists.

docs = [["price", "refund"], ["battery", "screen"]]

All lessons in this course

  1. What Topic Modeling Solves
  2. How LDA Groups Words Into Topics
  3. Running LDA With Gensim
  4. Interpreting and Labeling Topics
← Back to NLP Academy