Skip to content

Information flow labels #261

@shreyashankar

Description

@shreyashankar

We want to track information flow throughout a pipeline at a user-defined label granularity. The user will specify labels on some inputs (i.e., the most upstream input), and we must store and propagate them throughout the pipeline.

Steps:

  • Create 2 tables: (1) mapping label key to IOPointer, (2) mapping deleted label to time of deletion request
    Add ability to index by IOPointer
    • Perform db migration
    • Modify existing commit functions to also commit a label or dummy label
  • Implement retrieval functions for queries
    • Get all IOPointers for a label
    • Get all labels for an IOPointer
    • Forward trace (get all artifacts derived from a label)
  • Implement delete_label function
  • Modify component run function to:
    • compute labels for outputs based on earlier retrieval functions
    • check if label is fully deleted and block computation if necessary
  • Write dummy pipelines to test functionality

Metadata

Metadata

Assignees

No one assigned

    Labels

    LLarge task, maybe somewhat dreading (multiple day & refactor)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions