Tutorial for DNAnexus Platform (CLI)

All test samples and genome data are shared on our public DNAnexus project. You don't have to download any data for testing our pipeline on DNAnexus platform.

There are two methods to run our pipeline on DNAnexus.

Building your own DX workflow from chip.wdl with dxWDL (CLI)
Using a pre-built DX workflow on our public DX project (Web UI)

This document describes instruction for the item 1).

Sign up for a DNAnexus account.
Create a new DX project with name [YOUR_PROJECT_NAME] by clicking on "+New Project" on the top left.

Download dxWDL.

$ cd
$ wget https://github.com/dnanexus/dxWDL/releases/download/v1.46.4/dxWDL-v1.46.4.jar
$ chmod +rx dxWDL-v1.46.4.jar

Git clone this pipeline.

$ cd
$ git clone https://github.com/ENCODE-DCC/chip-seq-pipeline2

Move to pipeline's directory.
```
$ cd chip-seq-pipeline2
```

Choose an appropriate input for your project (AWS or Azure):

AWS

$ INPUT=example_input_json/dx/ENCSR936XTK_subsampled_chr19_only_dx.json

Azure

$ INPUT=example_input_json/dx_azure/ENCSR936XTK_subsampled_chr19_only_dx_azure.json

Make a WDL for DNAnexus use only. The original WDL will not work with inputs (e.g. BAMs, TAs) other than FASTQs. Then compile chip.dx.wdl with an input JSON for the SUBSAMPLED paired-end sample of ENCSR936XTK.

$ cp chip.wdl chip.dx.wdl
$ sed -i 's/Array\[File?\] bams = \[\]/Array\[File\] bams = \[\]/g' chip.dx.wdl
$ sed -i 's/Array\[File?\] nodup_bams = \[\]/Array\[File\] nodup_bams = \[\]/g' chip.dx.wdl
$ sed -i 's/Array\[File?\] tas = \[\]/Array\[File\] tas = \[\]/g' chip.dx.wdl
$ sed -i 's/Array\[File?\] ctl_bams = \[\]/Array\[File\] ctl_bams = \[\]/g' chip.dx.wdl
$ sed -i 's/Array\[File?\] ctl_nodup_bams = \[\]/Array\[File\] ctl_nodup_bams = \[\]/g' chip.dx.wdl
$ sed -i 's/Array\[File?\] ctl_tas = \[\]/Array\[File\] ctl_tas = \[\]/g' chip.dx.wdl

$ WDL=chip.dx.wdl
$ DXWDL=dxWDL-v1.46.4.jar
$ PROJECT=[YOUR_PROJECT_NAME]
$ OUT_FOLDER=/test_sample_chip_ENCSR936XTK_subsampled_chr19_only
$ DOCKER=$(cat ${WDL} | grep caper_docker | awk 'BEGIN{FS="'\''"} {print $2}')

$ java -jar ${DXWDL} compile ${WDL} -project ${PROJECT} -f -folder ${OUT_FOLDER} -defaults ${INPUT} -extras <(echo "{\"default_runtime_attributes\":{\"docker\":\"${DOCKER}\"}}")

Go to DNAnexus project page and click on your project.
Move to the directory /test_sample_chip_ENCSR936XTK_subsampled_chr19_only.
You will find a DX workflow chip with all parameters pre-defined. Click on it.
Specify an output directory by clicking "Workflow Actions" on the top right. Click on "Set output folder" and choose an output folder.
Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.
It will take about 6 hours. You will be able to find all outputs on your output folder. Final QC report (qc.html)/JSON (qc.json) will be found on it.
See full specification for input JSON file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tutorial_dx_cli.md

tutorial_dx_cli.md

Tutorial for DNAnexus Platform (CLI)

Files

tutorial_dx_cli.md

Latest commit

History

tutorial_dx_cli.md

File metadata and controls

Tutorial for DNAnexus Platform (CLI)