NextFlow_RD_Genomic

Description

A simple base Rare disease and germline genomics pipeline to test the effects of down-sampling on variant calling

Basic Overview

Using the NextFlow workflow software to run the following pipeline

Pipeline

Index genome > Fastqc analysis > Align reads > Downsample bam files > Sort bam > Mark duplicates > Index bam > Call variants > Hard filter

Setup

To run the pipeline, we need to obtain

A genome build (GRCh38) - provided by the Broad institute

$ cd data/genome
$ wget https://storage.googleapis.com/gcp-public-data--broad-references/hg38/v0/Homo_sapiens_assembly38.fasta

FastQ sample (for workflow development)

$ cd ../samples
$ wget https://ftp.sra.ebi.ac.uk/vol1/fastq/SRR151/003/SRR1518253/SRR1518253_1.fastq.gz && \
wget https://ftp.sra.ebi.ac.uk/vol1/fastq/SRR151/003/SRR1518253/SRR1518253_2.fastq.gz && \
gunzip *.gz

When scaling up, FastQ samples

$ wget https://genomics.viapath.co.uk/benchmark/files/FASTQ/NA12878_WES.zip

Running the pipeline

# Using Docker
$ nextflow run -profile docker main.nf

# Using docker in singularity
$ nextflow run -profile singularity main.nf

Validating the pipeline

See https://genomics.viapath.co.uk/benchmark

DNANexus applet setup (A local applet for basic testing)

DNANexus Python Bindings Documentation
Install the app

pip install -r requirements.txt

Routine maintenance Periodically update dxpy

$ pip install --upgrade dxpy

DNANexus Tutorial

$ dx select <your-project-name>
$ dx build --nextflow

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
conf		conf
dockerfiles		dockerfiles
modules		modules
README.md		README.md
build_images.sh		build_images.sh
docker.md		docker.md
environment.yml		environment.yml
main.nf		main.nf
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
requirements.txt		requirements.txt
running_your_nextflow_applet.md		running_your_nextflow_applet.md
samplesheet.tsv		samplesheet.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NextFlow_RD_Genomic

Description

Basic Overview

Pipeline

Setup

Running the pipeline

Validating the pipeline

DNANexus applet setup (A local applet for basic testing)

DNANexus Tutorial

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Peter-J-Freeman/NextFlow_RD_Genomic

Folders and files

Latest commit

History

Repository files navigation

NextFlow_RD_Genomic

Description

Basic Overview

Pipeline

Setup

Running the pipeline

Validating the pipeline

DNANexus applet setup (A local applet for basic testing)

DNANexus Tutorial

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages