Skip to content

jacobvp2/CourseProject

 
 

Repository files navigation

CS410 Course Project Team GOAT

Background

Reproducing a Paper. For our final project, we attempted to reproduce results from the (contexual text mining) research paper listed below:

ChengXiang Zhai, Atulya Velivelli, and Bei Yu. 2004. A cross-collection mixture model for comparative text mining. In Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining (KDD 2004). ACM, New York, NY, USA, 743-748. DOI=10.1145/1014052.1014150

The actual paper has been included in our repository for your own reference.

Setup/Dependencies

This repo assumes that the user has Python3 as well as Pip. If not, they can be found here.

Our project has two package dependencies, newsapi and numpy. Using the command "pip install -r requirements.txt" the user should be able to install the correct versions of both packages.

At this point, cd into directory /CourseProject/, run mixture.py, and follow the prompts in order to run our implementation of the Collective Text Mining comparison model.

Data

Our data is rather fluid in this case. We have an example of our demonstration video in a youtube link here. In an attempt to keep the data as similar as possible to the original paper, please use terms "Iraq" and "Afghanistan" when prompted.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%