This repository contains the code of my 2012 thesis (http://epub.wu.ac.at/3558/1/main.pdf).
@STANDARD {,
title = "Latent Dirichlet Allocation in R",
institution = "Institute for Statistics and Mathematics, WU (Wirtschaftsuniversität Wien), Austria",
author = "Martin Ponweiser",
language = "English",
type = "Diploma thesis"
year = "2012",
url = "http://epub.wu.ac.at/id/eprint/3558"
}
Folders:
application-pnas:LaTex/Sweave/Rcode of the main part of the thesis. Also included are the PNAS abstracts corpus and metadata intmformat.dirichlet:Sweave/Rcode to generate figures of the beta and Dirichlet distributionsweb-scraping: aPython/Scrapy/Rproject to web-scrape a corpus of PNAS journal abstracts