The aim of this project is to build a retrieval engine by using the concept of tf-idf and Latent Semantic Indexing. Tfidf is the short for term frequency inverse document frequency. It is used to evaluate how important a term is to a document in a collection. Latent Semantic Indexing use SVD to retrieve query to "concept".
#Lemur
The program is based on Lumer. Also, the SVD and LSI model is used.