Skip to content
This repository was archived by the owner on Mar 8, 2019. It is now read-only.

truas/Document_Parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Document_Parser

Small program to concatenate several different text files into one single file. Each line of this file is a single document Removes stop words, punctuation and other noise from text Can read any text file in subdirectory structure

Command line:

python3 doc_operations.py --input <input_folder_root> --ouput <output_folder_root>

[2018-03-13]

  1. 'files' folder for input/output should be placed in the same level as document_analysis

About

Small program to concatenate several different text files into one single file

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages