Genomic Annotation of Metagenomic Sequences
- Pre-process input reads with cutadapt and report with fastqc
- (Optionally) Merge overlapping reads with flash
- Analyse pre-processed reads with centrifuge
-
For running the pipeline you first need to have a working conda installation. Most of the calls require the activation of the relevant conda environment, so you will have to create them beforehand. The
requirements.ymlfiles can be found in theenvsdirectory. -
Some biopet tools. Latest versions recommended:
-
Since this is wdl you will also need cromwell for the execution of the (sub)-workflows, which you can find here.
Locally:
$ java -jar /path/to/cromwell.<version>.jar run pipeline.wdl -i inputs.json
On a cluster:
$ java -Dconfig.file=/path/to/cluster.config \
-jar /path/to/cromwell.<version>.jar \
run pipeline.wdl -i inputs.json
For shark users:
$ module load cromwell/31
$ java -Dconfig.file=$CROMWELL_SHARK_CONFIG -jar $CROMWELL_JAR run pipeline.wdl -i INPUTS.json