中文分词采用了jieba,并且使用了jieba的关键词提取API提高分类速度。训练数据的持久化使用了redis。项目中提供了贝叶斯和费舍尔分类的实现,从测试结果看准确率还是非常不错。 manager.py 提供了几个简单的测试和训练方法。
-
Notifications
You must be signed in to change notification settings - Fork 0
MSchumi/Classifier
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published