Running LDA With Gensim
Fit topics on a real corpus.
Meet Gensim
Gensim is a Python library built for topic modeling. It makes running LDA on real text fast and approachable. 🐍
from gensim import corpora, modelsStart With Tokens
Gensim expects each document as a list of clean tokens. You bring already lowercased, stopword-free word lists.
docs = [["price", "refund"], ["battery", "screen"]]All lessons in this course
- What Topic Modeling Solves
- How LDA Groups Words Into Topics
- Running LDA With Gensim
- Interpreting and Labeling Topics