Skip to content

cdBG output format description #9

@dkoslicki

Description

@dkoslicki

After running,

the first few lines are:

------------------------------------------------------------------------------------------------
---------------         columns: unitig id, next unitig ids, unitig content     ----------------
------------------------------------------------------------------------------------------------
1  644039  708271    AAAATTTTTTTTTTTTTTTT
2  1  51647    AAAAATTTTTTTTTTTTTTT
3  52749  53385    ATTAAAAAAATTTTTGTTTT
4  2  51649    AAAAAATTTTTTTTTTTTTT
5  629647  697882    TAATCACGACCCGTTTTATTTT

Two issues with this:

  1. It will be difficult to parse if the output file has an unconventionally formatted header. Consider describing the output format elsewhere and just print the columns. Also, be sure to choose a standard output format like TSV, CSV, or something else easily imported into python, R, and/or worked with on the Linux command line.
  2. There are 4 columns listed, but the header only describes three. Eg. what is the 708271 in the first content row?

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingdocumentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions