Skip to content

[EPIC] Creating a reusable QA workflow script #1

@jarmoza

Description

@jarmoza

Below are the tasks for a first stab at creating a generalized QA script for the PnP book processing and ingestion pipeline.

QA Script Pipeline Tasks

Master Script

  • One Python script should be set up with arguments to run the various possible subprocesses of this QA process and output metadata on each QA run (time run, slurm log list for each sbatch call, etc.)
  • An optional config file (i.e. yaml file) that includes directories and optionally a sequence of the QA subprocesses to run (i.e. clear old results, run new QA, gather results, analyze, etc.)
    See: Create Master QA script with Optional Config File #3

Subprocesses

Error Processing

Metadata Additions/Fixes

  • Image dimensions and area
  • Area difference from crop to original
  • Frobenius/L2 norm from crop to original
  • File count for originals and for each crop run
  • Percent area of crop to original Add Percent Area to Crop QA Metadata #7

Asserts

  • TBD acceptable %'s for metadata differences from crop to original images

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions