Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-970

[Alibabacloud] Failed to wait for bootstrapping to complete

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None

      Description of problem:

      IPI install failed, in .openshift_install.log “Bootstrap failed to complete: timed out waiting for the condition”.

       

      Version-Release number of selected component (if applicable):

      4.12.0-0.nightly-2022-09-06-081331

      How reproducible:

      Always

      Steps to Reproduce:

      1. IPI install on Alibabacloud cluster
      
      

      Actual results:

      $ oc --kubeconfig kubeconfig get nodes
      NAME                       STATUS     ROLES                  AGE   VERSION
      gpei-test-lxjzq-master-0   NotReady   control-plane,master   66m   v1.24.0+ebabf6d
      gpei-test-lxjzq-master-1   NotReady   control-plane,master   65m   v1.24.0+ebabf6d
      gpei-test-lxjzq-master-2   NotReady   control-plane,master   65m   v1.24.0+ebabf6d
      $ oc --kubeconfig kubeconfig get pods -A | grep Running
      openshift-cloud-controller-manager-operator        cluster-cloud-controller-manager-operator-7b5bd6c445-g9xw9   2/2     Running   0          67m
      openshift-kube-apiserver                           apiserver-watcher-gpei-test-lxjzq-master-0                   1/1     Running   0          66m
      openshift-kube-apiserver                           apiserver-watcher-gpei-test-lxjzq-master-1                   1/1     Running   0          64m
      openshift-kube-apiserver                           apiserver-watcher-gpei-test-lxjzq-master-2                   1/1     Running   0          64m
      $ oc --kubeconfig kubeconfig logs -n openshift-cloud-controller-manager-operator cluster-cloud-controller-manager-operator-7b5bd6c445-g9xw9
      ......
      E0907 03:04:16.624029       1 clusteroperator_controller.go:123] Unable to sync cluster operator status: Put "https://api-int.gpei-test.
      alicloud-qe.devcluster.openshift.com:6443/apis/config.openshift.io/v1/clusteroperators/cloud-controller-manager/status": dial tcp 10.0.7
      2.141:6443: connect: connection refused
      E0907 03:04:16.624089       1 controller.go:326] CCMOperator "msg"="Reconciler error" "error"="Put \"https://api-int.gpei-test.alicloud-
      qe.devcluster.openshift.com:6443/apis/config.openshift.io/v1/clusteroperators/cloud-controller-manager/status\": dial tcp 10.0.72.141:64
      43: connect: connection refused" "clusterOperator"={"name":"cloud-controller-manager"} "controller"="clusteroperator" "controllerGroup"=
      "config.openshift.io" "controllerKind"="ClusterOperator" "name"="cloud-controller-manager" "namespace"="" "reconcileID"="2d68a996-0d8c-4
      327-8075-b47b04fa9bf2"
      E0907 03:04:17.906174       1 leaderelection.go:330] error retrieving resource lock openshift-cloud-controller-manager-operator/cluster-
      cloud-controller-manager-leader: Get "https://api-int.gpei-test.alicloud-qe.devcluster.openshift.com:6443/apis/coordination.k8s.io/v1/na
      mespaces/openshift-cloud-controller-manager-operator/leases/cluster-cloud-controller-manager-leader": dial tcp 10.0.72.141:6443: connect
      : connection refused
      $ 

      Expected results:

      install succeed

      Additional info:

      4.12.0-0.nightly-2022-08-30-142847 is OK.
      Flexy-install job: https://mastern-jenkins-csb-openshift-qe.apps.ocp-c1.prod.psi.redhat.com/job/ocp-common/job/Flexy-install/134611/

            Unassigned Unassigned
            rhn-support-jiwei Jianli Wei
            Jianli Wei Jianli Wei
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: