Skip to content

This is a search engine designed to search for blogs on CSDN.net. The user interface is a WeChat Mini Porgram. The project includes crawling, text mining, inverted index, VSM, tf-idf, page rank and other methods. Due to private reason, this is a part of the whole project.

Notifications You must be signed in to change notification settings

steventan1011/SearchEngine-CSDN

Repository files navigation

SearchEngine-CSDN

This is a search engine designed to search for blogs on CSDN.net. The user interface is a WeChat Mini Porgram. The project includes crawling, text mining, inverted index, VSM, tf-idf, page rank and other methods.

CSDN-based Search Engine Design

[Python, C Language, user design, algorithm design, text analysis]

  1. Crawled data from the CSDN.net, processed the data with word segmentation, empty words filtering and stop words removing, generated the inverted index

  2. Used TF-IDF, VSM Weighting and Page Rank to complete the searching algorithm in Python

  3. Successfully completed the front end as a WeChat mini program with high retrieval precision and speed

About

This is a search engine designed to search for blogs on CSDN.net. The user interface is a WeChat Mini Porgram. The project includes crawling, text mining, inverted index, VSM, tf-idf, page rank and other methods. Due to private reason, this is a part of the whole project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published