Arguments to execute (all are must, no optional):
Path to folder containing input TSV files: -i "path".
Path to output folder where xhtml files should be written: -xo "path".
Path to output folder where json files should be written: -jo "path".
Deduplication on or off: -d value. Value should be 1 for on and 0 for off.
Note: Make sure you have linked tika-app-1.6.jar library to your project / is part of the classpath.