-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Open
Description
First, thanks for the library. Second, I found a problem with the requirements after the release of nltk == 3.8.2 for WordTokenizer
The aforementioned nltk == 3.8.2 solves a remote code execution vulnerability, disabled to obtain "punkt" and replaced it with "punkt_tab".
Issues:
- Remote code execution vulnerability in NLTK nltk/nltk#3266 (comment)
- [BUG] punkt_tab breaking change nltk/nltk#3293
This creates a problem with newspaper3k when downloading "punkt" in REQUIERED_CORPORA. Then I suggest modify REQUIRED_CORPORA or updating the requierements.py to avoid using nltk >= 3.8.2
Metadata
Metadata
Assignees
Labels
No labels