Uploaded image for project: 'Red Hat OpenShift AI Engineering'
  1. Red Hat OpenShift AI Engineering
  2. RHOAIENG-3913

RHOAI Operator Reports Degraded False with ComponentErrors

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False
    • Release Notes
    • No
    • Hide
      *Red Had OpenShift AI Operator incorrectly shows Degraded condition of False with an error*
      If you have enabled the KServe component in the DataScienceCluster (DSC) object used by the OpenShift AI Operator, but have not installed the dependent Red Hat OpenShift Service Mesh and Red Hat OpenShift Serverless Operators, the kserveReady condition in the DSC object correctly shows that KServe is not ready. However, the Degraded condition incorrectly shows a value of False.
      Workaround:
      Install the Red Hat OpenShift Serverless and Red Hat OpenShift Service Mesh Operators, and then recreate the DSC.
      Show
      *Red Had OpenShift AI Operator incorrectly shows Degraded condition of False with an error* If you have enabled the KServe component in the DataScienceCluster (DSC) object used by the OpenShift AI Operator, but have not installed the dependent Red Hat OpenShift Service Mesh and Red Hat OpenShift Serverless Operators, the kserveReady condition in the DSC object correctly shows that KServe is not ready. However, the Degraded condition incorrectly shows a value of False. Workaround: Install the Red Hat OpenShift Serverless and Red Hat OpenShift Service Mesh Operators, and then recreate the DSC.
    • Known Issue
    • Done
    • No
    • Testable

      When installing the DSC, if kserve is enabled and servicemesh and serverless are not installed on the cluster, the DSC correctly reports that KServe is not ready:

      - lastHeartbeatTime: '2024-02-29T00:03:14Z' 
        lastTransitionTime: '2024-02-29T00:03:14Z' 
        message: "Component reconciliation failed: 2 errors occurred:\n\t* operator servicemeshoperator not found. Please install the operator before enabling kserve component\n\t* operator serverless-operator not found. Please install the operator before enabling kserve component\n\n" 
        reason: ReconcileFailed 
        status: 'False' 
        type: kserveReady

      However, the degraded status still shows False even thought it is reporting an error:

       

          - lastHeartbeatTime: '2024-02-29T00:03:21Z'
            lastTransitionTime: '2024-02-28T23:58:11Z'
            message: "DataScienceCluster resource reconciled with component errors: 2 errors occurred:\n\t* operator servicemeshoperator not found. Please install the operator before enabling kserve component\n\t* operator serverless-operator not found. Please install the operator before enabling kserve component\n\n"
            reason: ReconcileCompletedWithComponentErrors
            status: 'False'
            type: Degraded 

       

      When an error is present, the operator should report `status: 'True'`

       

      To Reproduce:

      Create the default DSC without the serverless and servicemesh operators installed.

       

      Tested on 2.7 with OpenShift 4.15

            vhire Vaishnavi Hire
            troyer@redhat.com Trevor Royer
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated: