Skip to content

Monitoring #1041

@thorwolpert

Description

@thorwolpert

In order to be as proactive as possible, monitoring, alerting and reporting on key areas needs to be implemented. There is other infrastructure monitoring in place managed by the core SRE team, but the items below are the projects responsibility to manage.

DLQ (Dead Letter Queue) - Items that are logged here have failed to be applied and require human intervention:

  • strr-emailer-queue - emails that were unable to be processed by notify
  • strr-pay-queue - payments that have failed to be against applications

Interactions - failures logged here are when a notification has failed to be delivered to the receipient.

  • interaction_status is FAILED or (SENT and an excessive time has passed)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions