Skip to content

Failed to convert gnomAD v4.1 VCFs to CRISPRme-required format #82

@schuroff

Description

@schuroff

Describe the bug
Failed to convert gnomAD v4.1 VCFs to CRISPRme-required format.

To Reproduce

We installed CRISPRme v2.1.7 using mamba.
We also tried with conda, but the same issue occurred.
The installation completed successfully and:
crisprme.py --version #worked as expected.

We are using gnomAD v4.1 joint VCFs.

When running the gnomAD-converter command as described in the documentation, we got the following error:

Command-line:
~/crisprme$ crisprme.py gnomAD-converter --gnomAD_VCFdir ~/crisprme/VCFs/hg38_gnomad.v41/ --samplesID ~/crisprme/sampleIDs/samplesIDs.gnomad.v41.txt --joint --thread 20 --multiallelic --debug
DEBUG MODE
python: can't open file '/home/tiaf/crisprme/PostProcess/convert_gnomAD_vcfs.py': [Errno 2] No such file or directory
Traceback (most recent call last):
File "/home/tiaf/miniconda3/envs/crisprme/bin/crisprme.py", line 1319, in
gnomAD_converter()
File "/home/tiaf/miniconda3/envs/crisprme/bin/crisprme.py", line 1101, in gnomAD_converter
raise subprocess.SubprocessError(
subprocess.SubprocessError: An error occurred while converting gnomAD VCFs in /home/tiaf/crisprme/VCFs/hg38_gnomad.v41

After checking, we found the convert_gnomAD_vcfs.py file in the following locations:
/home/tiaf/miniconda3/envs/crisprme/opt/crisprme/PostProcess/convert_gnomAD_vcfs.py
/home/tiaf/miniconda3/pkgs/crisprme-2.1.7-hdfd78af_0/opt/crisprme/PostProcess/convert_gnomAD_vcfs.py
/home/tiaf/crisprme/CRISPRme/src/convert_gnomAD_vcfs.py
/home/tiaf/PostProcess/convert_gnomAD_vcfs.py

Then, we copied the folder /home/tiaf/miniconda3/pkgs/crisprme-2.1.7-hdfd78af_0/opt/crisprme/PostProcess/ to /home/tiaf/crisprme/
and re-ran the command. This time, we got a different error:

~/crisprme$ crisprme.py gnomAD-converter --gnomAD_VCFdir ~/crisprme/VCFs/hg38_gnomad.v41/ --samplesID ~/crisprme/sampleIDs/samplesIDs.gnomad.v41.txt --joint --thread 20 --multiallelic --debug
DEBUG MODE
Traceback (most recent call last):
File "/home/tiaf/crisprme/PostProcess/convert_gnomAD_vcfs.py", line 27, in
from utils import remove
ModuleNotFoundError: No module named 'utils'
Traceback (most recent call last):
File "/home/tiaf/miniconda3/envs/crisprme/bin/crisprme.py", line 1319, in
gnomAD_converter()
File "/home/tiaf/miniconda3/envs/crisprme/bin/crisprme.py", line 1101, in gnomAD_converter
raise subprocess.SubprocessError(
subprocess.SubprocessError: An error occurred while converting gnomAD VCFs in /home/tiaf/crisprme/VCFs/hg38_gnomad.v41

We highlight the following error: ModuleNotFoundError: No module named 'utils'. What is it?
In addition, we tried to run the complete search below:

crisprme.py complete-search --genome Genomes/hg38 --vcf vcf_gnomad_config22.txt --guide HBG-175.txt --pam PAMs/20bp-NG-SpCas9.txt --samplesID ~/crisprme/sampleIDs/samplesIDs.gnomad.v41.txt --be-window 4,8 --be-base A,G --mm 3 --bDNA 1 --bRNA 1 --merge 3 --output chr22_search --thread 20

and got the following errors:
mv: cannot stat '/home/tiaf/crisprme/Genomes/variants_genome/SNPs_genome/hg38_enriched/': No such file or di
rectory
mv: cannot stat '/home/tiaf/crisprme/Genomes/variants_genome/SNPs_genome/.json': No such file or directory
mv: cannot stat '/home/tiaf/crisprme/Genomes/variants_genome/SNPs_genome/log
.txt': No such file or director
y
Traceback (most recent call last):
File "/home/tiaf/miniconda3/envs/crisprme/opt/crisprme/PostProcess/./pool_index_indels.py", line 25, in
for f in os.listdir(indels_folder):
FileNotFoundError: [Errno 2] No such file or directory: '/home/tiaf/crisprme/Genomes/variants_genome/'
ERROR: indels indexing failed on hg38_gnomad.v41

We believe this error is due to the failed VCF conversion.

Additional test

We also tested a smaller dataset (chr22) using:

~/crisprme$ crisprme.py complete-test --chrom chr22 --vcf_dataset 1000G
Downloading fasta file for chromosome(s) chr22
Downloading VCF data for chromsome(s) chr22
Creating VCF config file for dataset(s) 1000G
Downloading sample ids for dataset(s) 1000G
Creating samples config file for dataset(s) 1000G
Downloading ENCODE and GENCODE annotation data
Creating PAM file
Creating guide file
python: can't open file '/home/tiaf/crisprme/crisprme.py': [Errno 2] No such file or directory

Thank you for your help!
Best,
Paulo Schuroff

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions