Skip to content

yunfeim2/CourseProject

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

CourseProject

This project is about reproducing the results for latent aspect analysis.

The source code is from the paper provided with the data.

Software installation

first we should install the nltk tool using the command:

import nltk
nltk.download()

The project will be base on a mix of java and python

run the project

first we should generator the key work for our latent aspect analysis by using the code

python key_generator.py

the stopwords.txt is for the stop work that will like occur in all kinds of documents. Therefore we ignore the exsitence of the those words to produce a better keyword collection.

After that, download the NLP from https://opennlp.apache.org/ to install and put the enviromental variable to path to the folder you installed

Next, we should using java platform to run the analyse.java under src and the final result of the hotel data will be listed under vectors folder in vector_CHI_4000.dat

presentation vedio : https://mediaspace.illinois.edu/media/1_64gzbvp4

Sources

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 94.3%
  • Python 5.5%
  • Batchfile 0.2%