Skip to content

Crash on large metatranscriptomics database #49

@fdelogu

Description

@fdelogu

Hi,

I am trying to run mmseq-linux v1.0.9 using a large metatranscriptomics dataset as reference (>4 million ORFs). The run stops after saying “Amalgamating transcripts and calculating summary statistics...”, due to memory limits (SIGSEGV: 11) when I used the maximum RAM available to me (3TB).
Before the crash the following output files are produced along the way:
sample.k,
sample.M,
sample.gene.trace_gibbs.gz,
sample.identical.trace_gibbs.gz,
sample.prop.trace_gibbs.gz,
sample.trace_gibbs.gz.

I’ve run some tests on a E.coli dataset and observed that the memory consumption jumps during the “Amalgamation of transcripts and calculation of summary statistics” requiring several times the one used during the rest of the computation. Is there a way to circumvent this problem and complete the run?

Best regards,
Francesco

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions