CourseProject

This project is about reproducing the results for latent aspect analysis.

The source code is from the paper provided with the data.

Software installation

first we should install the nltk tool using the command:

import nltk
nltk.download()

The project will be base on a mix of java and python

run the project

first we should generator the key work for our latent aspect analysis by using the code

python key_generator.py

the stopwords.txt is for the stop work that will like occur in all kinds of documents. Therefore we ignore the exsitence of the those words to produce a better keyword collection.

After that, download the NLP from https://opennlp.apache.org/ to install and put the enviromental variable to path to the folder you installed

Next, we should using java platform to run the analyse.java under src and the final result of the hotel data will be listed under vectors folder in vector_CHI_4000.dat

presentation vedio : https://mediaspace.illinois.edu/media/1_64gzbvp4

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
the source code		the source code
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CourseProject

Software installation

run the project

Sources

About

Uh oh!

Releases

Packages

Languages

yunfeim2/CourseProject

Folders and files

Latest commit

History

Repository files navigation

CourseProject

Software installation

run the project

Sources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages