Skip to content

HyperDunk/tika-bot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tika-bot

Arguments to execute (all are must, no optional):

Path to folder containing input TSV files: -i "path".

Path to output folder where xhtml files should be written: -xo "path".

Path to output folder where json files should be written: -jo "path".

Deduplication on or off: -d value. Value should be 1 for on and 0 for off.

Note: Make sure you have linked tika-app-1.6.jar library to your project / is part of the classpath.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages