-
Notifications
You must be signed in to change notification settings - Fork 7
Description
Hi deeppolisher team,
Thanks for developing this tool!
I tried to use this tool to polish degu genome built from PacBio hifi reads. After the inference step, the resulting vcf file is empty and the log file shows "Total window processed: 0". Below are some more details about how I used the tool.
The initial input files I have are two haplotype specific genomes (hap1 and hap2) and multiple hifi read files (fasta). Since we do not have ONT UL reads to run PHAROAPH pipeline, I combined the two haplotype genomes into a diploid genome and aligned the hifi reads to the diploid genome using minimap2 with default parameters. I combined bam files from multiple hifi read files and sorted and used the final merged bam file as the input for the deeppolisher tool.
I followed the case study to first generate the image directory, using the diploid genome and the merged sorted bam file as inputs. The output directory contains multiple *tfrecords.gz files which are ~1.5G each. I then used this directory and the checkpoint file provided in the docker image /opt/models/pacbio_model/checkpoint to run the inference step. The inference step finished in a few seconds, which might suggest that it cannot find any proper "window" in the provided diploid genome.
Do you know what might be the potential reasons for why the vcf file is empty? I can provide more details if that's helpful.
Thanks!
Bing