Skip to content

NamithaGS/HMM_POSTagger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

#Hidden Markov Model Part of Speech Tagger

This is a Hidden Markov Model part-of-speech tagger for Catalan. The training data provided is tokenized and tagged. The test data is tokenized, the program will add the tags.

python hmmlearn.py /path/to/input
The argument is a single file containing the training data;
the program will learn a hidden Markov model, and write the model parameters to a file called hmmmodel.txt.

python hmmdecode.py /path/to/input
The argument is a single file containing the test data;
the program will read the parameters of a hidden Markov model from the file hmmmodel.txt, tag each word in the test data, and write the results to a text file called hmmoutput.txt in the same format as the training data.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages