Skip to content

VCF file generated from inference step is empty #9

@ypauling

Description

@ypauling

Hi deeppolisher team,
Thanks for developing this tool!
I tried to use this tool to polish degu genome built from PacBio hifi reads. After the inference step, the resulting vcf file is empty and the log file shows "Total window processed: 0". Below are some more details about how I used the tool.

The initial input files I have are two haplotype specific genomes (hap1 and hap2) and multiple hifi read files (fasta). Since we do not have ONT UL reads to run PHAROAPH pipeline, I combined the two haplotype genomes into a diploid genome and aligned the hifi reads to the diploid genome using minimap2 with default parameters. I combined bam files from multiple hifi read files and sorted and used the final merged bam file as the input for the deeppolisher tool.

I followed the case study to first generate the image directory, using the diploid genome and the merged sorted bam file as inputs. The output directory contains multiple *tfrecords.gz files which are ~1.5G each. I then used this directory and the checkpoint file provided in the docker image /opt/models/pacbio_model/checkpoint to run the inference step. The inference step finished in a few seconds, which might suggest that it cannot find any proper "window" in the provided diploid genome.

Do you know what might be the potential reasons for why the vcf file is empty? I can provide more details if that's helpful.

Thanks!
Bing

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions