Uploaded image for project: 'Red Hat OpenShift AI Engineering'
  1. Red Hat OpenShift AI Engineering
  2. RHOAIENG-3020

document alert on 'distributed workload is queued for to long'

    XMLWordPrintable

Details

    • Task
    • Resolution: Unresolved
    • Undefined
    • None
    • None
    • Distributed Workloads
    • False
    • Hide

      None

      Show
      None
    • False
    • No
    • No
    • RHOAI DOCS 2.10
    • Testable

    Description

      As an RHOAI Operator, I want to create a PrometheusRule, so that I get alerted when the average time in queue of a Distributed Workload is about a threshold. 

      Attachments

        Activity

          People

            bmccolga@redhat.com Breda McColgan
            cgoern1@redhat.com Christoph Görn
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:

              PagerDuty