Skip to content

mponweiser/thesis-LDA-in-R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Latent Dirichlet Allocation in R

This repository contains the code of my 2012 thesis (http://epub.wu.ac.at/3558/1/main.pdf).

@STANDARD {,
    title       = "Latent Dirichlet Allocation in R",
    institution = "Institute for Statistics and Mathematics, WU (Wirtschaftsuniversität Wien), Austria",
    author      = "Martin Ponweiser",
    language    = "English",
    type        = "Diploma thesis"
    year        = "2012",
    url         = "http://epub.wu.ac.at/id/eprint/3558"
  }

Folders:

  • application-pnas: LaTex/Sweave/R code of the main part of the thesis. Also included are the PNAS abstracts corpus and metadata in tm format.
  • dirichlet: Sweave/R code to generate figures of the beta and Dirichlet distributions
  • web-scraping: a Python/Scrapy/R project to web-scrape a corpus of PNAS journal abstracts

About

Source code of my 2012 thesis "Latent Dirichlet Allocation in R"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published