Skip to content

FASTA parser issue #64

@cgps-admin

Description

@cgps-admin

One of our users has managed to trigger a bug in the Kleborate FASTA parsing. If the headers take the following format, Kleborate falls over fairly quickly (I'm not sure of the error as it is swallowed by our wrapper):

>genome_id #1
ATATAT...
>genome_id #2
ATATATTT...
>genome_id #3
CGTACG...

Presumably, the unique part of the header is discarded during parsing, and only the part before the space is being used to identify the contigs. Running `sed -i 's/ /_/g' was enough to "fix" the files and get them running.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions