Small program to concatenate several different text files into one single file. Each line of this file is a single document Removes stop words, punctuation and other noise from text Can read any text file in subdirectory structure
python3 doc_operations.py --input <input_folder_root> --ouput <output_folder_root>
[2018-03-13]
- 'files' folder for input/output should be placed in the same level as document_analysis