AVAH - Accelerating Variant Calling on Human Genomes

This is a subproject of EVA.

Acknowledgments

This work is supported by the National Science Foundation under Grant No. 2034247.

Environments

Spark 3.0.0, Hadoop 3.2.0, Scala 2.12.8 (for Adam-Cannoli)
Spark 2.4.7, Hadoop 2.7.6, Scala 2.11.12 (for GATK4)

Hadoop 3+ must use etc/hadoop/workers to list the data nodes; check using hdfs dfsadmin -report

Setup and execution of variant analysis using AVAH

The instructions are here.

Rebuilding the JAR if needed

This is a Scala project. You can use sbt to compile and package the project. The JAR file should be copied manually to lib/ before executing AVAH.

If you wish to change the scalaVersion in build.sbt, run reload before rebuilding the JAR.

Useful YARN commands

To check YARN jobs:

yarn application -list

To kill YARN jobs:

yarn application -kill <application_ID>

To see YARN queues:

mapred queue -list

To change YARN's scheduler configuration via command line

yarn schedulerconf

Examples:

yarn schedulerconf -global yarn.scheduler.maximum-allocation-mb=16384

yarn schedulerconf -global yarn.scheduler.maximum-allocation-vcores=32

yarn schedulerconf -global yarn.scheduler.maximum-allocation-mb=16384,yarn.scheduler.maximum-allocation-vcores=32

To check status

yarn queue -status default

To view YARN logs

yarn logs -applicationId <application_ID>

To view cluster usage

yarn top

yarn node -all -list

yarn node -showDetails -list

Monitoring process execution

dstat --cpu --mem --load --top-cpu --top-mem -dn --output report.csv 2 10

or

dstat --cpu --mem --load --top-cpu --top-mem -dn --noupdate --output report.csv 2 10

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
lib		lib
misc		misc
scripts		scripts
src/main/scala		src/main/scala
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AVAH - Accelerating Variant Calling on Human Genomes

Acknowledgments

Environments

Setup and execution of variant analysis using AVAH

Rebuilding the JAR if needed

Useful YARN commands

To check status

To view YARN logs

To view cluster usage

Monitoring process execution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

raopr/AVAH

Folders and files

Latest commit

History

Repository files navigation

AVAH - Accelerating Variant Calling on Human Genomes

Acknowledgments

Environments

Setup and execution of variant analysis using AVAH

Rebuilding the JAR if needed

Useful YARN commands

To check status

To view YARN logs

To view cluster usage

Monitoring process execution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages