This is the Code for "Generative Models - The Math of Intelligence #8" By Siraj Raval on Youtube
Write a script to perform Latent Dirichlet Allocation on a text dataset of your choice. Bonus points if it solves a real world problem. Sample datasets here. Good luck!
This is the code for this video on Youtube by Siraj Raval as part of The Math of Intelligence course. I have 2 implementations of LDA here. The Jupyter notebook one uses helper libraries and the plain python one is from scratch.
- nltk
- numpy
Download missing dependencies using pip
Type jupyter notebook in terminal to see the code pop up in your browser. Install jupyter here if you haven't. For the python code, set the file manually in the code to your local dataset then run python LDA.py
Credits for this code go to moreene. I've merely created a wrapper to get people started.