Skip to content

Automated Reporting for Systems Biology Modeling Workflows #281

@matthiaskoenig

Description

@matthiaskoenig

Automated Reporting for Systems Biology Modeling Workflows

Background

Computational systems biology relies heavily on standardized modeling and simulation workflows, most notably SBML (Systems Biology Markup Language) for models, SED-ML (Simulation Experiment Description Markup Language) for simulation setups or PEtab for parameter optimization workflows. While robust toolchains exist for parameter estimation and sensitivity analysis, the communication of results remains a major bottleneck.

Currently, results are often scattered across scripts, log files, plots, and tables, making it difficult to ensure reproducibility, transparency, and accessibility for collaborators and the wider community. There is a clear need for automated, reproducible reporting pipelines that transform modeling workflows into human-readable, well-structured reports.

This project builds on and contributes directly to sbmlsim, an open-source Python framework for simulation, parameter optimization, and sensitivity analysis of SBML models:

Several existing model repositories already use CI/CD workflows to execute simulations, but reporting is mostly manual or ad hoc. This project addresses this gap by integrating standardized report generation into modeling workflows.

Goal

The goal of this GSoC project is to design and implement an automated reporting framework for systems biology modeling workflows based on SBML, SED-ML, and PETab integrated into sbmlsim.

The project will:

  • Generate interactive web reports using Quarto (interactive figures and tables).
  • Generate static PDF reports using Typst (publication-ready summaries and figures).
  • Integrate report generation into CI/CD pipelines (GitHub Actions).
  • Automatically publish web reports via GitHub Pages (github.io).

The final outcome will be reusable reporting modules and templates within sbmlsim that can be adopted across multiple systems biology and systems medicine projects.

Difficulty Level: Medium

This project is rated Medium because it combines Python development, scientific workflows, CI/CD automation, and report generation. While advanced mathematical modeling is not required, familiarity with reproducible computational workflows is important.

Size and Length of Project

  • medium: 175 hours
  • 12 weeks

Skills

Essential skills

  • Python
  • Markdown
  • Git / GitHub

Nice to have skills

  • Quarto
  • Typst
  • CI/CD (GitHub Actions)
  • Systems biology or scientific computing background

Public Repository

Primary development repository:

Contributing to sbmlsim on develop branch:

Example model repositories using CI/CD workflows:

All code will be open source and publicly available throughout and after GSoC.

Potential Mentors

AI Usage Policy

AI tools (e.g. large language models for code suggestions, refactoring, or drafting documentation) are permitted as assistive tools, under the following conditions:

  • The contributor must fully understand, review, and validate all AI-assisted outputs.
  • The student remains fully responsible for correctness, licensing compliance, and scientific validity.
  • AI tools must not be used to fabricate data, results, or evaluations.
  • Substantial AI assistance should be transparently documented in commit messages or pull requests.

The project emphasizes reproducibility, transparency, and responsible use of AI as a supporting instrument—not a substitute for scientific reasoning or software engineering.

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions