Job:
#OCPBUGS-32375issue10 days agoUnsuccessful cluster installation with 4.15 nightlies on s390x using ABI CLOSED
Issue 15945005: Unsuccessful cluster installation with 4.15 nightlies on s390x using ABI
Description: When used the latest s390x release builds in 4.15 nightly stream for Agent Based Installation of SNO on IBM Z KVM, installation is failing at the end while watching cluster operators even though the DNS and HA Proxy configurations are perfect as the same setup is working with 4.15.x stable release image builds 
 
 Below is the error encountered multiple times when used "release:s390x-latest" image while booting the cluster. This image is used during the boot through OPENSHIFT_INSATLL_RELEASE_IMAGE_OVERRIDE while the binary is fetched using the latest stable builds from here : [https://mirror.openshift.com/pub/openshift-v4/s390x/clients/ocp/latest/] for which the version would be around 4.15.x 
 
 *release-image:*
 {code:java}
 registry.build01.ci.openshift.org/ci-op-cdkdqnqn/release@sha256:c6eb4affa5c44d2ad220d7064e92270a30df5f26d221e35664f4d5547a835617
 {code}
  ** 
 
 *PROW CI Build :* [https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_release/47965/rehearse-47965-periodic-ci-openshift-multiarch-master-nightly-4.15-e2e-agent-ibmz-sno/1780162365824700416] 
 
 *Error:* 
 {code:java}
 '/root/agent-sno/openshift-install wait-for install-complete --dir /root/agent-sno/ --log-level debug'
 Warning: Permanently added '128.168.142.71' (ED25519) to the list of known hosts.
 level=debug msg=OpenShift Installer 4.15.8
 level=debug msg=Built from commit f4f5d0ee0f7591fd9ddf03ac337c804608102919
 level=debug msg=Loading Install Config...
 level=debug msg=  Loading SSH Key...
 level=debug msg=  Loading Base Domain...
 level=debug msg=    Loading Platform...
 level=debug msg=  Loading Cluster Name...
 level=debug msg=    Loading Base Domain...
 level=debug msg=    Loading Platform...
 level=debug msg=  Loading Pull Secret...
 level=debug msg=  Loading Platform...
 level=debug msg=Loading Agent Config...
 level=debug msg=Using Agent Config loaded from state file
 level=warning msg=An agent configuration was detected but this command is not the agent wait-for command
 level=info msg=Waiting up to 40m0s (until 10:15AM UTC) for the cluster at https://api.agent-sno.abi-ci.com:6443 to initialize...
 W0416 09:35:51.793770    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:35:51.793827    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:35:53.127917    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:35:53.127946    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:35:54.760896    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:35:54.761058    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:36:00.790136    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:36:00.790175    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:36:08.516333    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:36:08.516445    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:36:31.442291    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:36:31.442336    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:37:03.033971    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:37:03.034049    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:37:42.025487    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:37:42.025538    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:38:32.148607    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:38:32.148677    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:39:27.680156    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:39:27.680194    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:40:23.290839    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:40:23.290988    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:41:22.298200    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:41:22.298338    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:42:01.197417    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:42:01.197465    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:42:36.739577    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:42:36.739937    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:43:07.331029    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:43:07.331154    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:44:04.008310    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:44:04.008381    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:44:40.882938    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:44:40.882973    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:45:18.975189    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:45:18.975307    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:45:49.753584    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:45:49.753614    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:46:41.148207    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:46:41.148347    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:47:12.882965    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:47:12.883075    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:47:53.636491    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:47:53.636538    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:48:31.792077    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:48:31.792165    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:49:29.117579    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:49:29.117657    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:50:02.802033    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:50:02.802167    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:50:33.826705    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:50:33.826859    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:51:16.045403    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:51:16.045447    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:51:53.795710    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:51:53.795745    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:52:52.741141    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:52:52.741289    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:53:52.621642    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:53:52.621687    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:54:35.809906    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:54:35.810054    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:55:24.249298    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:55:24.249418    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:56:12.717328    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:56:12.717372    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:56:51.172375    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:56:51.172439    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:57:42.242226    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:57:42.242292    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:58:17.663810    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:58:17.663849    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 09:59:13.319754    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 09:59:13.319889    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:00:03.188117    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:00:03.188166    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:00:54.590362    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:00:54.590494    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:01:35.673592    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:01:35.673633    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:02:11.552079    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:02:11.552133    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:02:51.110525    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:02:51.110663    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:03:31.251376    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:03:31.251494    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:04:21.566895    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:04:21.566931    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:04:52.754047    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:04:52.754221    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:05:24.673675    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:05:24.673724    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:06:17.608482    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:06:17.608598    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:06:58.215116    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:06:58.215262    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:07:46.578262    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:07:46.578392    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:08:18.239710    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:08:18.239830    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:09:06.947178    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:09:06.947239    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:10:00.261401    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:10:00.261486    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:10:59.363041    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:10:59.363113    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:11:32.205551    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:11:32.205612    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:12:24.956052    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:12:24.956147    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:12:55.353860    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:12:55.354004    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:13:39.223095    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:13:39.223170    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:14:25.018278    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:14:25.018404    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 W0416 10:15:17.227351    1589 reflector.go:535] k8s.io/client-go/tools/watch/informerwatcher.go:146: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 E0416 10:15:17.227424    1589 reflector.go:147] k8s.io/client-go/tools/watch/informerwatcher.go:146: Failed to watch *v1.ClusterVersion: failed to list *v1.ClusterVersion: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusterversions?fieldSelector=metadata.name%3Dversion&limit=500&resourceVersion=0": dial tcp 10.244.64.4:6443: connect: connection refused
 level=error msg=Attempted to gather ClusterOperator status after wait failure: listing ClusterOperator objects: Get "https://api.agent-sno.abi-ci.com:6443/apis/config.openshift.io/v1/clusteroperators": dial tcp 10.244.64.4:6443: connect: connection refused
 level=error msg=Cluster initialization failed because one or more operators are not functioning properly.
 level=error msg=The cluster should be accessible for troubleshooting as detailed in the documentation linked below,
 level=error msg=https://docs.openshift.com/container-platform/latest/support/troubleshooting/troubleshooting-installations.html
 level=error msg=The 'wait-for install-complete' subcommand can then be used to continue the installation
 level=error msg=failed to initialize the cluster: timed out waiting for the condition
 {"component":"entrypoint","error":"wrapped process failed: exit status 6","file":"k8s.io/test-infra/prow/entrypoint/run.go:84","func":"k8s.io/test-infra/prow/entrypoint.Options.internalRun","level":"error","msg":"Error executing test process","severity":"error","time":"2024-04-16T10:15:51Z"}
 error: failed to execute wrapped command: exit status 6 {code}
Status: CLOSED
#OCPBUGS-32517issue38 hours agoMissing worker nodes on metal Verified
Mon 2024-04-22 05:33:53 UTC localhost.localdomain master-bmh-update.service[12603]: Unpause all baremetal hosts
Mon 2024-04-22 05:33:53 UTC localhost.localdomain master-bmh-update.service[18264]: E0422 05:33:53.630867   18264 memcache.go:265] couldn't get current server API group list: Get "https://localhost:6443/api?timeout=32s": dial tcp [::1]:6443: connect: connection refused
Mon 2024-04-22 05:33:53 UTC localhost.localdomain master-bmh-update.service[18264]: E0422 05:33:53.631351   18264 memcache.go:265] couldn't get current server API group list: Get "https://localhost:6443/api?timeout=32s": dial tcp [::1]:6443: connect: connection refused

... 4 lines not shown

#OCPBUGS-31763issue10 days agogcp install cluster creation fails after 30-40 minutes New
Issue 15921939: gcp install cluster creation fails after 30-40 minutes
Description: Component Readiness has found a potential regression in install should succeed: overall.  I see this on various different platforms, but I started digging into GCP failures.  No installer log bundle is created, which seriously hinders my ability to dig further.
 
 Bootstrap succeeds, and then 30 minutes after waiting for cluster creation, it dies.
 
 From [https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-nightly-4.16-e2e-gcp-sdn-serial/1775871000018161664]
 
 search.ci tells me this affects nearly 10% of jobs on GCP:
 
 [https://search.dptools.openshift.org/?search=Attempted+to+gather+ClusterOperator+status+after+installation+failure%3A+listing+ClusterOperator+objects.*connection+refused&maxAge=168h&context=1&type=bug%2Bissue%2Bjunit&name=.*4.16.*gcp.*&excludeName=&maxMatches=5&maxBytes=20971520&groupBy=job]
 
  
 {code:java}
 time="2024-04-04T13:27:50Z" level=info msg="Waiting up to 40m0s (until 2:07PM UTC) for the cluster at https://api.ci-op-n3pv5pn3-4e5f3.XXXXXXXXXXXXXXXXXXXXXX:6443 to initialize..."
 time="2024-04-04T14:07:50Z" level=error msg="Attempted to gather ClusterOperator status after installation failure: listing ClusterOperator objects: Get \"https://api.ci-op-n3pv5pn3-4e5f3.XXXXXXXXXXXXXXXXXXXXXX:6443/apis/config.openshift.io/v1/clusteroperators\": dial tcp 35.238.130.20:6443: connect: connection refused"
 time="2024-04-04T14:07:50Z" level=error msg="Cluster initialization failed because one or more operators are not functioning properly.\nThe cluster should be accessible for troubleshooting as detailed in the documentation linked below,\nhttps://docs.openshift.com/container-platform/latest/support/troubleshooting/troubleshooting-installations.html\nThe 'wait-for install-complete' subcommand can then be used to continue the installation"
 time="2024-04-04T14:07:50Z" level=error msg="failed to initialize the cluster: timed out waiting for the condition" {code}
  
 
 Probability of significant regression: 99.44%
 
 Sample (being evaluated) Release: 4.16
 Start Time: 2024-03-29T00:00:00Z
 End Time: 2024-04-04T23:59:59Z
 Success Rate: 68.75%
 Successes: 11
 Failures: 5
 Flakes: 0
 
 Base (historical) Release: 4.15
 Start Time: 2024-02-01T00:00:00Z
 End Time: 2024-02-28T23:59:59Z
 Success Rate: 96.30%
 Successes: 52
 Failures: 2
 Flakes: 0
 
 View the test details report at [https://sippy.dptools.openshift.org/sippy-ng/component_readiness/test_details?arch=amd64&arch=amd64&baseEndTime=2024-02-28%2023%3A59%3A59&baseRelease=4.15&baseStartTime=2024-02-01%2000%3A00%3A00&capability=Other&component=Installer%20%2F%20openshift-installer&confidence=95&environment=sdn%20upgrade-micro%20amd64%20gcp%20standard&excludeArches=arm64%2Cheterogeneous%2Cppc64le%2Cs390x&excludeClouds=openstack%2Cibmcloud%2Clibvirt%2Covirt%2Cunknown&excludeVariants=hypershift%2Cosd%2Cmicroshift%2Ctechpreview%2Csingle-node%2Cassisted%2Ccompact&groupBy=cloud%2Carch%2Cnetwork&ignoreDisruption=true&ignoreMissing=false&minFail=3&network=sdn&network=sdn&pity=5&platform=gcp&platform=gcp&sampleEndTime=2024-04-04%2023%3A59%3A59&sampleRelease=4.16&sampleStartTime=2024-03-29%2000%3A00%3A00&testId=cluster%20install%3A0cb1bb27e418491b1ffdacab58c5c8c0&testName=install%20should%20succeed%3A%20overall&upgrade=upgrade-micro&upgrade=upgrade-micro&variant=standard&variant=standard]
Status: New
#OCPBUGS-27755issue9 days agoopenshift-kube-apiserver down and is not being restarted New
Issue 15736514: openshift-kube-apiserver down and is not being restarted
Description: Description of problem:
 {code:none}
 SNO cluster, this is the second time that the issue happens. 
 
 Error like the following are reported:
 
 ~~~
 failed to fetch token: Post "https://api-int.<cluster>:6443/api/v1/namespaces/openshift-cluster-storage-operator/serviceaccounts/cluster-storage-operator/token": dial tcp <ip>:6443: connect: connection refused
 ~~~
 
 Checking the pods logs, kube-apiserver pod is terminated and is not being restarted again:
 
 ~~~
 2024-01-13T09:41:40.931716166Z I0113 09:41:40.931584       1 main.go:213] Received signal terminated. Forwarding to sub-process "hyperkube".
 ~~~{code}
 Version-Release number of selected component (if applicable):
 {code:none}
    4.13.13 {code}
 How reproducible:
 {code:none}
     Not reproducible but has happened twice{code}
 Steps to Reproduce:
 {code:none}
     1.
     2.
     3.
     {code}
 Actual results:
 {code:none}
     API is not available and kube-apiserver is not being restarted{code}
 Expected results:
 {code:none}
     We would expect to see kube-apiserver restarts{code}
 Additional info:
 {code:none}
    {code}
Status: New
#OCPBUGS-33157issue38 hours agoIPv6 metal-ipi jobs: master-bmh-update loosing access to API Verified
Issue 15978085: IPv6 metal-ipi jobs: master-bmh-update loosing access to API
Description: The last 4 IPv6 jobs are failing on the same error
 
 https://prow.ci.openshift.org/job-history/gs/test-platform-results/logs/periodic-ci-openshift-release-master-nightly-4.16-e2e-metal-ipi-ovn-ipv6
 master-bmh-update.log looses access to the the API when trying to get/update the BMH details
 
 https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-nightly-4.16-e2e-metal-ipi-ovn-ipv6/1785492737169035264
 
 
 
 {noformat}
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[4663]: Waiting for 3 masters to become provisioned
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[24484]: E0501 03:32:23.531242   24484 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ostest.test.metalkube.org:6443/api?timeout=32s": dial tcp [fd2e:6f44:5dd8:c956::5]:6443: connect: connection refused
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[24484]: E0501 03:32:23.531808   24484 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ostest.test.metalkube.org:6443/api?timeout=32s": dial tcp [fd2e:6f44:5dd8:c956::5]:6443: connect: connection refused
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[24484]: E0501 03:32:23.533281   24484 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ostest.test.metalkube.org:6443/api?timeout=32s": dial tcp [fd2e:6f44:5dd8:c956::5]:6443: connect: connection refused
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[24484]: E0501 03:32:23.533630   24484 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ostest.test.metalkube.org:6443/api?timeout=32s": dial tcp [fd2e:6f44:5dd8:c956::5]:6443: connect: connection refused
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[24484]: E0501 03:32:23.535180   24484 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ostest.test.metalkube.org:6443/api?timeout=32s": dial tcp [fd2e:6f44:5dd8:c956::5]:6443: connect: connection refused
 May 01 03:32:23 localhost.localdomain master-bmh-update.sh[24484]: The connection to the server api-int.ostest.test.metalkube.org:6443 was refused - did you specify the right host or port?
 {noformat}
Status: Verified
{noformat}
May 01 02:49:40 localhost.localdomain master-bmh-update.sh[12448]: E0501 02:49:40.429468   12448 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ostest.test.metalkube.org:6443/api?timeout=32s": dial tcp [fd2e:6f44:5dd8:c956::5]:6443: connect: connection refused
{noformat}
#OCPBUGS-17183issue2 days ago[BUG] Assisted installer fails to create bond with active backup for single node installation New
Issue 15401516: [BUG] Assisted installer fails to create bond with active backup for single node installation
Description: Description of problem:
 {code:none}
 The assisted installer will always fail to create bond with active backup using nmstate yaml and the errors are : 
 
 ~~~ 
 Jul 26 07:11:47 <hostname> bootkube.sh[8366]: Unable to reach API_URL's https endpoint at https://xx.xx.32.40:6443/version
 Jul 26 07:11:47 <hostname> bootkube.sh[8366]: Checking validity of <hostname> of type API_INT_URL 
 Jul 26 07:11:47 <hostname> bootkube.sh[8366]: Successfully resolved API_INT_URL <hostname> 
 Jul 26 07:11:47 <hostname> bootkube.sh[8366]: Unable to reach API_INT_URL's https endpoint at https://xx.xx.32.40:6443/versionJul 26 07:12:23 <hostname> bootkube.sh[12960]: Still waiting for the Kubernetes API: 
 Get "https://localhost:6443/readyz": dial tcp [::1]:6443: connect: connection refusedJul 26 07:15:15 <hostname> bootkube.sh[15706]: The connection to the server <hostname>:6443 was refused - did you specify the right host or port? 
 Jul 26 07:15:15 <hostname> bootkube.sh[15706]: The connection to the server <hostname>:6443 was refused - did you specify the right host or port? 
  ~~~ 
 
 Where, <hostname> is the actual hostname of the node. 
 
 Adding sosreport and nmstate yaml file here : https://drive.google.com/drive/u/0/folders/19dNzKUPIMmnUls2pT_stuJxr2Dxdi5eb{code}
 Version-Release number of selected component (if applicable):
 {code:none}
 4.12 
 Dell 16g Poweredge R660{code}
 How reproducible:
 {code:none}
 Always at customer side{code}
 Steps to Reproduce:
 {code:none}
 1. Open Assisted installer UI (console.redhat.com -> assisted installer) 
 2. Add the network configs as below for host1  
 
 -----------
 interfaces:
 - name: bond99
   type: bond
   state: up
   ipv4:
     address:
     - ip: xx.xx.32.40
       prefix-length: 24
     enabled: true
   link-aggregation:
     mode: active-backup
     options:
       miimon: '140'
     port:
     - eno12399
     - eno12409
 dns-resolver:
   config:
     search:
     - xxxx
     server:
     - xx.xx.xx.xx
 routes:
   config:
     - destination: 0.0.0.0/0
       metric: 150
       next-hop-address: xx.xx.xx.xx
       next-hop-interface: bond99
       table-id: 254    
 -----------
 
 3. Enter the mac addresses of interfaces in the fields. 
 4. Generate the iso and boot the node. The node will not be able to ping/ssh. This happen everytime and reproducible.
 5. As there was no way to check (due to ssh not working) what is happening on the node, we reset root password and can see that ip address was present on bond, still ping/ssh does not work.
 6. After multiple reboots, customer was able to ssh/ping and provided sosreport and we could see above mentioned error in the journal logs in sosreport.  
  {code}
 Actual results:
 {code:none}
 Fails to install. Seems there is some issue with networking.{code}
 Expected results:
 {code:none}
 Able to proceed with installation without above mentioned issues{code}
 Additional info:
 {code:none}
 - The installation works with round robbin bond mode in 4.12. 
 - Also, the installation works with active-backup 4.10. 
 - Active-backup bond with 4.12 is failing.{code}
Status: New
#OCPBUGS-30631issue2 weeks agoSNO (RT kernel) sosreport crash the SNO node CLOSED
Issue 15865131: SNO (RT kernel) sosreport crash the SNO node
Description: Description of problem:
 {code:none}
 sosreport collection causes SNO XR11 node crash.
 {code}
 Version-Release number of selected component (if applicable):
 {code:none}
 - RHOCP    : 4.12.30
 - kernel   : 4.18.0-372.69.1.rt7.227.el8_6.x86_64
 - platform : x86_64{code}
 How reproducible:
 {code:none}
 sh-4.4# chrt -rr 99 toolbox
 .toolboxrc file detected, overriding defaults...
 Checking if there is a newer version of ocpdalmirror.xxx.yyy:8443/rhel8/support-tools-zzz-feb available...
 Container 'toolbox-root' already exists. Trying to start...
 (To remove the container and start with a fresh toolbox, run: sudo podman rm 'toolbox-root')
 toolbox-root
 Container started successfully. To exit, type 'exit'.
 [root@node /]# which sos
 /usr/sbin/sos
 logger: socket /dev/log: No such file or directory
 [root@node /]# taskset -c 29-31,61-63 sos report --batch -n networking,kernel,processor -k crio.all=on -k crio.logs=on -k podman.all=on -kpodman.logs=on
 
 sosreport (version 4.5.6)
 
 This command will collect diagnostic and configuration information from
 this Red Hat CoreOS system.
 
 An archive containing the collected information will be generated in
 /host/var/tmp/sos.c09e4f7z and may be provided to a Red Hat support
 representative.
 
 Any information provided to Red Hat will be treated in accordance with
 the published support policies at:
 
         Distribution Website : https://www.redhat.com/
         Commercial Support   : https://access.redhat.com/
 
 The generated archive may contain data considered sensitive and its
 content should be reviewed by the originating organization before being
 passed to any third party.
 
 No changes will be made to system configuration.
 
 
  Setting up archive ...
  Setting up plugins ...
 [plugin:auditd] Could not open conf file /etc/audit/auditd.conf: [Errno 2] No such file or directory: '/etc/audit/auditd.conf'
 caught exception in plugin method "system.setup()"
 writing traceback to sos_logs/system-plugin-errors.txt
 [plugin:systemd] skipped command 'resolvectl status': required services missing: systemd-resolved.
 [plugin:systemd] skipped command 'resolvectl statistics': required services missing: systemd-resolved.
  Running plugins. Please wait ...
 
   Starting 1/91  alternatives    [Running: alternatives]
   Starting 2/91  atomichost      [Running: alternatives atomichost]
   Starting 3/91  auditd          [Running: alternatives atomichost auditd]
   Starting 4/91  block           [Running: alternatives atomichost auditd block]
   Starting 5/91  boot            [Running: alternatives auditd block boot]
   Starting 6/91  cgroups         [Running: auditd block boot cgroups]
   Starting 7/91  chrony          [Running: auditd block cgroups chrony]
   Starting 8/91  cifs            [Running: auditd block cgroups cifs]
   Starting 9/91  conntrack       [Running: auditd block cgroups conntrack]
   Starting 10/91 console         [Running: block cgroups conntrack console]
   Starting 11/91 container_log   [Running: block cgroups conntrack container_log]
   Starting 12/91 containers_common [Running: block cgroups conntrack containers_common]
   Starting 13/91 crio            [Running: block cgroups conntrack crio]
   Starting 14/91 crypto          [Running: cgroups conntrack crio crypto]
   Starting 15/91 date            [Running: cgroups conntrack crio date]
   Starting 16/91 dbus            [Running: cgroups conntrack crio dbus]
   Starting 17/91 devicemapper    [Running: cgroups conntrack crio devicemapper]
   Starting 18/91 devices         [Running: cgroups conntrack crio devices]
   Starting 19/91 dracut          [Running: cgroups conntrack crio dracut]
   Starting 20/91 ebpf            [Running: cgroups conntrack crio ebpf]
   Starting 21/91 etcd            [Running: cgroups crio ebpf etcd]
   Starting 22/91 filesys         [Running: cgroups crio ebpf filesys]
   Starting 23/91 firewall_tables [Running: cgroups crio filesys firewall_tables]
   Starting 24/91 fwupd           [Running: cgroups crio filesys fwupd]
   Starting 25/91 gluster         [Running: cgroups crio filesys gluster]
   Starting 26/91 grub2           [Running: cgroups crio filesys grub2]
   Starting 27/91 gssproxy        [Running: cgroups crio grub2 gssproxy]
   Starting 28/91 hardware        [Running: cgroups crio grub2 hardware]
   Starting 29/91 host            [Running: cgroups crio hardware host]
   Starting 30/91 hts             [Running: cgroups crio hardware hts]
   Starting 31/91 i18n            [Running: cgroups crio hardware i18n]
   Starting 32/91 iscsi           [Running: cgroups crio hardware iscsi]
   Starting 33/91 jars            [Running: cgroups crio hardware jars]
   Starting 34/91 kdump           [Running: cgroups crio hardware kdump]
   Starting 35/91 kernelrt        [Running: cgroups crio hardware kernelrt]
   Starting 36/91 keyutils        [Running: cgroups crio hardware keyutils]
   Starting 37/91 krb5            [Running: cgroups crio hardware krb5]
   Starting 38/91 kvm             [Running: cgroups crio hardware kvm]
   Starting 39/91 ldap            [Running: cgroups crio kvm ldap]
   Starting 40/91 libraries       [Running: cgroups crio kvm libraries]
   Starting 41/91 libvirt         [Running: cgroups crio kvm libvirt]
   Starting 42/91 login           [Running: cgroups crio kvm login]
   Starting 43/91 logrotate       [Running: cgroups crio kvm logrotate]
   Starting 44/91 logs            [Running: cgroups crio kvm logs]
   Starting 45/91 lvm2            [Running: cgroups crio logs lvm2]
   Starting 46/91 md              [Running: cgroups crio logs md]
   Starting 47/91 memory          [Running: cgroups crio logs memory]
   Starting 48/91 microshift_ovn  [Running: cgroups crio logs microshift_ovn]
   Starting 49/91 multipath       [Running: cgroups crio logs multipath]
   Starting 50/91 networkmanager  [Running: cgroups crio logs networkmanager]
 
 Removing debug pod ...
 error: unable to delete the debug pod "ransno1ransnomavdallabcom-debug": Delete "https://api.ransno.mavdallab.com:6443/api/v1/namespaces/openshift-debug-mt82m/pods/ransno1ransnomavdallabcom-debug": dial tcp 10.71.136.144:6443: connect: connection refused
 {code}
 Steps to Reproduce:
 {code:none}
 Launch a debug pod and the procedure above and it crash the node{code}
 Actual results:
 {code:none}
 Node crash{code}
 Expected results:
 {code:none}
 Node does not crash{code}
 Additional info:
 {code:none}
 We have two vmcore on the associated SFDC ticket.
 This system use a RT kernel.
 Using an out of tree ice driver 1.13.7 (probably from 22 dec 2023)
 
 [  103.681608] ice: module unloaded
 [  103.830535] ice: loading out-of-tree module taints kernel.
 [  103.831106] ice: module verification failed: signature and/or required key missing - tainting kernel
 [  103.841005] ice: Intel(R) Ethernet Connection E800 Series Linux Driver - version 1.13.7
 [  103.841017] ice: Copyright (C) 2018-2023 Intel Corporation
 
 
 With the following kernel command line 
 
 Command line: BOOT_IMAGE=(hd0,gpt3)/ostree/rhcos-f2c287e549b45a742b62e4f748bc2faae6ca907d24bb1e029e4985bc01649033/vmlinuz-4.18.0-372.69.1.rt7.227.el8_6.x86_64 ignition.platform.id=metal ostree=/ostree/boot.1/rhcos/f2c287e549b45a742b62e4f748bc2faae6ca907d24bb1e029e4985bc01649033/0 root=UUID=3e8bda80-5cf4-4c46-b139-4c84cb006354 rw rootflags=prjquota boot=UUID=1d0512c2-3f92-42c5-b26d-709ff9350b81 intel_iommu=on iommu=pt firmware_class.path=/var/lib/firmware skew_tick=1 nohz=on rcu_nocbs=3-31,35-63 tuned.non_isolcpus=00000007,00000007 systemd.cpu_affinity=0,1,2,32,33,34 intel_iommu=on iommu=pt isolcpus=managed_irq,3-31,35-63 nohz_full=3-31,35-63 tsc=nowatchdog nosoftlockup nmi_watchdog=0 mce=off rcutree.kthread_prio=11 default_hugepagesz=1G rcupdate.rcu_normal_after_boot=0 efi=runtime module_blacklist=irdma intel_pstate=passive intel_idle.max_cstate=0 crashkernel=256M
 
 
 
 vmcore1 show issue with the ice driver 
 
 crash vmcore tmp/vmlinux
 
 
       KERNEL: tmp/vmlinux  [TAINTED]
     DUMPFILE: vmcore  [PARTIAL DUMP]
         CPUS: 64
         DATE: Thu Mar  7 17:16:57 CET 2024
       UPTIME: 02:44:28
 LOAD AVERAGE: 24.97, 25.47, 25.46
        TASKS: 5324
     NODENAME: aaa.bbb.ccc
      RELEASE: 4.18.0-372.69.1.rt7.227.el8_6.x86_64
      VERSION: #1 SMP PREEMPT_RT Fri Aug 4 00:21:46 EDT 2023
      MACHINE: x86_64  (1500 Mhz)
       MEMORY: 127.3 GB
        PANIC: "Kernel panic - not syncing:"
          PID: 693
      COMMAND: "khungtaskd"
         TASK: ff4d1890260d4000  [THREAD_INFO: ff4d1890260d4000]
          CPU: 0
        STATE: TASK_RUNNING (PANIC)
 
 crash> ps|grep sos                                                                                                                                                                                                                                                                                                           
   449071  363440  31  ff4d189005f68000  IN   0.2  506428 314484  sos                                                                                                                                                                                                                                                         
   451043  363440  63  ff4d188943a9c000  IN   0.2  506428 314484  sos                                                                                                                                                                                                                                                         
   494099  363440  29  ff4d187f941f4000  UN   0.2  506428 314484  sos     
 
  8457.517696] ------------[ cut here ]------------
 [ 8457.517698] NETDEV WATCHDOG: ens3f1 (ice): transmit queue 35 timed out
 [ 8457.517711] WARNING: CPU: 33 PID: 349 at net/sched/sch_generic.c:472 dev_watchdog+0x270/0x300
 [ 8457.517718] Modules linked in: binfmt_misc macvlan pci_pf_stub iavf vfio_pci vfio_virqfd vfio_iommu_type1 vfio vhost_net vhost vhost_iotlb tap tun xt_addrtype nf_conntrack_netlink ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_nat xt_CT tcp_diag inet_diag ip6t_MASQUERADE xt_mark ice(OE) xt_conntrack ipt_MASQUERADE nft_counter xt_comment nft_compat veth nft_chain_nat nf_tables overlay bridge 8021q garp mrp stp llc nfnetlink_cttimeout nfnetlink openvswitch nf_conncount nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ext4 mbcache jbd2 intel_rapl_msr iTCO_wdt iTCO_vendor_support dell_smbios wmi_bmof dell_wmi_descriptor dcdbas kvm_intel kvm irqbypass intel_rapl_common i10nm_edac nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp coretemp rapl ipmi_ssif intel_cstate intel_uncore dm_thin_pool pcspkr isst_if_mbox_pci dm_persistent_data dm_bio_prison dm_bufio isst_if_mmio isst_if_common mei_me i2c_i801 joydev mei intel_pmt wmi acpi_ipmi ipmi_si acpi_power_meter sctp ip6_udp_tunnel
 [ 8457.517770]  udp_tunnel ip_tables xfs libcrc32c i40e sd_mod t10_pi sg bnxt_re ib_uverbs ib_core crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel bnxt_en ahci libahci libata dm_multipath dm_mirror dm_region_hash dm_log dm_mod ipmi_devintf ipmi_msghandler fuse [last unloaded: ice]
 [ 8457.517784] Red Hat flags: eBPF/rawtrace
 [ 8457.517787] CPU: 33 PID: 349 Comm: ktimers/33 Kdump: loaded Tainted: G           OE    --------- -  - 4.18.0-372.69.1.rt7.227.el8_6.x86_64 #1
 [ 8457.517789] Hardware name: Dell Inc. PowerEdge XR11/0P2RNT, BIOS 1.12.1 09/13/2023
 [ 8457.517790] RIP: 0010:dev_watchdog+0x270/0x300
 [ 8457.517793] Code: 17 00 e9 f0 fe ff ff 4c 89 e7 c6 05 c6 03 34 01 01 e8 14 43 fa ff 89 d9 4c 89 e6 48 c7 c7 90 37 98 9a 48 89 c2 e8 1d be 88 ff <0f> 0b eb ad 65 8b 05 05 13 fb 65 89 c0 48 0f a3 05 1b ab 36 01 73
 [ 8457.517795] RSP: 0018:ff7aeb55c73c7d78 EFLAGS: 00010286
 [ 8457.517797] RAX: 0000000000000000 RBX: 0000000000000023 RCX: 0000000000000001
 [ 8457.517798] RDX: 0000000000000000 RSI: ffffffff9a908557 RDI: 00000000ffffffff
 [ 8457.517799] RBP: 0000000000000021 R08: ffffffff9ae6b3a0 R09: 00080000000000ff
 [ 8457.517800] R10: 000000006443a462 R11: 0000000000000036 R12: ff4d187f4d1f4000
 [ 8457.517801] R13: ff4d187f4d20df00 R14: ff4d187f4d1f44a0 R15: 0000000000000080
 [ 8457.517803] FS:  0000000000000000(0000) GS:ff4d18967a040000(0000) knlGS:0000000000000000
 [ 8457.517804] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
 [ 8457.517805] CR2: 00007fc47c649974 CR3: 00000019a441a005 CR4: 0000000000771ea0
 [ 8457.517806] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
 [ 8457.517807] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
 [ 8457.517808] PKRU: 55555554
 [ 8457.517810] Call Trace:
 [ 8457.517813]  ? test_ti_thread_flag.constprop.50+0x10/0x10
 [ 8457.517816]  ? test_ti_thread_flag.constprop.50+0x10/0x10
 [ 8457.517818]  call_timer_fn+0x32/0x1d0
 [ 8457.517822]  ? test_ti_thread_flag.constprop.50+0x10/0x10
 [ 8457.517825]  run_timer_softirq+0x1fc/0x640
 [ 8457.517828]  ? _raw_spin_unlock_irq+0x1d/0x60
 [ 8457.517833]  ? finish_task_switch+0xea/0x320
 [ 8457.517836]  ? __switch_to+0x10c/0x4d0
 [ 8457.517840]  __do_softirq+0xa5/0x33f
 [ 8457.517844]  run_timersd+0x61/0xb0
 [ 8457.517848]  smpboot_thread_fn+0x1c1/0x2b0
 [ 8457.517851]  ? smpboot_register_percpu_thread_cpumask+0x140/0x140
 [ 8457.517853]  kthread+0x151/0x170
 [ 8457.517856]  ? set_kthread_struct+0x50/0x50
 [ 8457.517858]  ret_from_fork+0x1f/0x40
 [ 8457.517861] ---[ end trace 0000000000000002 ]---
 [ 8458.520445] ice 0000:8a:00.1 ens3f1: tx_timeout: VSI_num: 14, Q 35, NTC: 0x99, HW_HEAD: 0x14, NTU: 0x15, INT: 0x0
 [ 8458.520451] ice 0000:8a:00.1 ens3f1: tx_timeout recovery level 1, txqueue 35
 [ 8506.139246] ice 0000:8a:00.1: PTP reset successful
 [ 8506.437047] ice 0000:8a:00.1: VSI rebuilt. VSI index 0, type ICE_VSI_PF
 [ 8506.445482] ice 0000:8a:00.1: VSI rebuilt. VSI index 1, type ICE_VSI_CTRL
 [ 8540.459707] ice 0000:8a:00.1 ens3f1: tx_timeout: VSI_num: 14, Q 35, NTC: 0xe3, HW_HEAD: 0xe7, NTU: 0xe8, INT: 0x0
 [ 8540.459714] ice 0000:8a:00.1 ens3f1: tx_timeout recovery level 1, txqueue 35
 [ 8563.891356] ice 0000:8a:00.1: PTP reset successful
 ~~~
 
 Second vmcore on the same node show issue with the SSD drive
 
 $ crash vmcore-2 tmp/vmlinux
 
       KERNEL: tmp/vmlinux  [TAINTED]
     DUMPFILE: vmcore-2  [PARTIAL DUMP]
         CPUS: 64
         DATE: Thu Mar  7 14:29:31 CET 2024
       UPTIME: 1 days, 07:19:52
 LOAD AVERAGE: 25.55, 26.42, 28.30
        TASKS: 5409
     NODENAME: aaa.bbb.ccc
      RELEASE: 4.18.0-372.69.1.rt7.227.el8_6.x86_64
      VERSION: #1 SMP PREEMPT_RT Fri Aug 4 00:21:46 EDT 2023
      MACHINE: x86_64  (1500 Mhz)
       MEMORY: 127.3 GB
        PANIC: "Kernel panic - not syncing:"
          PID: 696
      COMMAND: "khungtaskd"
         TASK: ff2b35ed48d30000  [THREAD_INFO: ff2b35ed48d30000]
          CPU: 34
        STATE: TASK_RUNNING (PANIC)
 
 crash> ps |grep sos
   719784  718369  62  ff2b35ff00830000  IN   0.4 1215636 563388  sos
   721740  718369  61  ff2b3605579f8000  IN   0.4 1215636 563388  sos
   721742  718369  63  ff2b35fa5eb9c000  IN   0.4 1215636 563388  sos
   721744  718369  30  ff2b3603367fc000  IN   0.4 1215636 563388  sos
   721746  718369  29  ff2b360557944000  IN   0.4 1215636 563388  sos
   743356  718369  62  ff2b36042c8e0000  IN   0.4 1215636 563388  sos
   743818  718369  29  ff2b35f6186d0000  IN   0.4 1215636 563388  sos
   748518  718369  61  ff2b3602cfb84000  IN   0.4 1215636 563388  sos
   748884  718369  62  ff2b360713418000  UN   0.4 1215636 563388  sos
 
 crash> dmesg
 
 [111871.309883] ata3.00: exception Emask 0x0 SAct 0x3ff8 SErr 0x0 action 0x6 frozen
 [111871.309889] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309891] ata3.00: cmd 61/40:18:28:47:4b/00:00:00:00:00/40 tag 3 ncq dma 32768 out
                          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 [111871.309895] ata3.00: status: { DRDY }
 [111871.309897] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309904] ata3.00: cmd 61/40:20:68:47:4b/00:00:00:00:00/40 tag 4 ncq dma 32768 out
                          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 [111871.309908] ata3.00: status: { DRDY }
 [111871.309909] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309910] ata3.00: cmd 61/40:28:a8:47:4b/00:00:00:00:00/40 tag 5 ncq dma 32768 out
                          res 40/00:ff:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
 [111871.309913] ata3.00: status: { DRDY }
 [111871.309914] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309915] ata3.00: cmd 61/40:30:e8:47:4b/00:00:00:00:00/40 tag 6 ncq dma 32768 out
                          res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 [111871.309918] ata3.00: status: { DRDY }
 [111871.309919] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309919] ata3.00: cmd 61/70:38:48:37:2b/00:00:1c:00:00/40 tag 7 ncq dma 57344 out
                          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 [111871.309922] ata3.00: status: { DRDY }
 [111871.309923] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309924] ata3.00: cmd 61/20:40:78:29:0c/00:00:19:00:00/40 tag 8 ncq dma 16384 out
                          res 40/00:ff:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
 [111871.309927] ata3.00: status: { DRDY }
 [111871.309928] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309929] ata3.00: cmd 61/08:48:08:0c:c0/00:00:1c:00:00/40 tag 9 ncq dma 4096 out
                          res 40/00:ff:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
 [111871.309932] ata3.00: status: { DRDY }
 [111871.309933] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309934] ata3.00: cmd 61/40:50:28:48:4b/00:00:00:00:00/40 tag 10 ncq dma 32768 out
                          res 40/00:01:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
 [111871.309937] ata3.00: status: { DRDY }
 [111871.309938] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309939] ata3.00: cmd 61/40:58:68:48:4b/00:00:00:00:00/40 tag 11 ncq dma 32768 out
                          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 [111871.309942] ata3.00: status: { DRDY }
 [111871.309943] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309944] ata3.00: cmd 61/40:60:a8:48:4b/00:00:00:00:00/40 tag 12 ncq dma 32768 out
                          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 [111871.309946] ata3.00: status: { DRDY }
 [111871.309947] ata3.00: failed command: WRITE FPDMA QUEUED
 [111871.309948] ata3.00: cmd 61/40:68:e8:48:4b/00:00:00:00:00/40 tag 13 ncq dma 32768 out
                          res 40/00:01:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
 [111871.309951] ata3.00: status: { DRDY }
 [111871.309953] ata3: hard resetting link
 ...
 ...
 ...
 [112789.787310] INFO: task sos:748884 blocked for more than 600 seconds.                                                                                                                                                                                                                                                     
 [112789.787314]       Tainted: G           OE    --------- -  - 4.18.0-372.69.1.rt7.227.el8_6.x86_64 #1                                                                                                                                                                                                                      
 [112789.787316] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.                                                                                                                                                                                                                                    
 [112789.787316] task:sos             state:D stack:    0 pid:748884 ppid:718369 flags:0x00084080                                                                                                                                                                                                                             
 [112789.787320] Call Trace:                                                                                                                                                                                                                                                                                                  
 [112789.787323]  __schedule+0x37b/0x8e0                                                                                                                                                                                                                                                                                      
 [112789.787330]  schedule+0x6c/0x120                                                                                                                                                                                                                                                                                         
 [112789.787333]  schedule_timeout+0x2b7/0x410                                                                                                                                                                                                                                                                                
 [112789.787336]  ? enqueue_entity+0x130/0x790                                                                                                                                                                                                                                                                                
 [112789.787340]  wait_for_completion+0x84/0xf0                                                                                                                                                                                                                                                                               
 [112789.787343]  flush_work+0x120/0x1d0                                                                                                                                                                                                                                                                                      
 [112789.787347]  ? flush_workqueue_prep_pwqs+0x130/0x130                                                                                                                                                                                                                                                                     
 [112789.787350]  schedule_on_each_cpu+0xa7/0xe0                                                                                                                                                                                                                                                                              
 [112789.787353]  vmstat_refresh+0x22/0xa0                                                                                                                                                                                                                                                                                    
 [112789.787357]  proc_sys_call_handler+0x174/0x1d0                                                                                                                                                                                                                                                                           
 [112789.787361]  vfs_read+0x91/0x150                                                                                                                                                                                                                                                                                         
 [112789.787364]  ksys_read+0x52/0xc0                                                                                                                                                                                                                                                                                         
 [112789.787366]  do_syscall_64+0x87/0x1b0                                                                                                                                                                                                                                                                                    
 [112789.787369]  entry_SYSCALL_64_after_hwframe+0x61/0xc6                                                                                                                                                                                                                                                                    
 [112789.787372] RIP: 0033:0x7f2dca8c2ab4                                                                                                                                                                                                                                                                                     
 [112789.787378] Code: Unable to access opcode bytes at RIP 0x7f2dca8c2a8a.                                                                                                                                                                                                                                                   
 [112789.787378] RSP: 002b:00007f2dbbffc5e0 EFLAGS: 00000246 ORIG_RAX: 0000000000000000                                                                                                                                                                                                                                       
 [112789.787380] RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f2dca8c2ab4                                                                                                                                                                                                                                            
 [112789.787382] RDX: 0000000000004000 RSI: 00007f2db402b5a0 RDI: 0000000000000008                                                                                                                                                                                                                                            
 [112789.787383] RBP: 00007f2db402b5a0 R08: 0000000000000000 R09: 00007f2dcace27bb                                                                                                                                                                                                                                            
 [112789.787383] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000004000                                                                                                                                                                                                                                            
 [112789.787384] R13: 0000000000000008 R14: 00007f2db402b5a0 R15: 00007f2da4001a90                                                                                                                                                                                                                                            
 [112789.787418] NMI backtrace for cpu 34    {code}
Status: CLOSED
#OCPBUGS-32091issue4 weeks agoCAPI-Installer leaks processes during unsuccessful installs MODIFIED
ERROR Attempted to gather debug logs after installation failure: failed to create SSH client: ssh: handshake failed: ssh: disconnect, reason 2: Too many authentication failures
ERROR Attempted to gather ClusterOperator status after installation failure: listing ClusterOperator objects: Get "https://api.gpei-0515.qe.devcluster.openshift.com:6443/apis/config.openshift.io/v1/clusteroperators": dial tcp 3.134.9.157:6443: connect: connection refused
ERROR Bootstrap failed to complete: Get "https://api.gpei-0515.qe.devcluster.openshift.com:6443/version": dial tcp 18.222.8.23:6443: connect: connection refused

... 1 lines not shown

periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-aws-sdn-upgrade (all) - 27 runs, 37% failed, 250% of failures match = 93% impact
#1791553314731593728junit25 hours ago
May 17 21:06:56.592 E ns/openshift-sdn pod/sdn-controller-wpvzc node/ip-10-0-157-78.ec2.internal uid/090d3535-6e03-4189-931a-123b06071ad3 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 17 21:06:57.615 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-157-78.ec2.internal node/ip-10-0-157-78.ec2.internal uid/de852e53-7982-43db-a04b-680e6c17e575 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0517 21:06:56.175054       1 cmd.go:216] Using insecure, self-signed certificates\nI0517 21:06:56.187572       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715980016 cert, and key in /tmp/serving-cert-3151910436/serving-signer.crt, /tmp/serving-cert-3151910436/serving-signer.key\nI0517 21:06:56.875479       1 observer_polling.go:159] Starting file observer\nW0517 21:06:56.898349       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-157-78.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0517 21:06:56.898539       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0517 21:06:56.928725       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3151910436/tls.crt::/tmp/serving-cert-3151910436/tls.key"\nF0517 21:06:57.127894       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 17 21:07:02.968 E ns/openshift-cluster-csi-drivers pod/aws-ebs-csi-driver-node-7hfkf node/ip-10-0-157-78.ec2.internal uid/47776caf-fe46-41b0-9700-457a4c8a2e66 container/csi-node-driver-registrar reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1791553314731593728junit25 hours ago
May 17 21:07:04.081 E ns/openshift-network-diagnostics pod/network-check-target-2pck7 node/ip-10-0-157-78.ec2.internal uid/a33d682d-b977-409c-9450-92989d9121d4 container/network-check-target-container reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 17 21:07:04.127 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-157-78.ec2.internal node/ip-10-0-157-78.ec2.internal uid/de852e53-7982-43db-a04b-680e6c17e575 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0517 21:06:56.175054       1 cmd.go:216] Using insecure, self-signed certificates\nI0517 21:06:56.187572       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715980016 cert, and key in /tmp/serving-cert-3151910436/serving-signer.crt, /tmp/serving-cert-3151910436/serving-signer.key\nI0517 21:06:56.875479       1 observer_polling.go:159] Starting file observer\nW0517 21:06:56.898349       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-157-78.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0517 21:06:56.898539       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0517 21:06:56.928725       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3151910436/tls.crt::/tmp/serving-cert-3151910436/tls.key"\nF0517 21:06:57.127894       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 17 21:07:05.204 E ns/openshift-multus pod/network-metrics-daemon-h66hp node/ip-10-0-157-78.ec2.internal uid/61a3ec0d-cefc-4ae9-98f4-a61424190670 container/network-metrics-daemon reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1791244565240352768junit46 hours ago
May 17 00:41:12.416 E ns/openshift-multus pod/network-metrics-daemon-t75jd node/ip-10-0-162-195.us-west-1.compute.internal uid/27fbc80d-c0a3-4fc2-af8e-e2b39708b682 container/network-metrics-daemon reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 17 00:41:13.393 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-162-195.us-west-1.compute.internal node/ip-10-0-162-195.us-west-1.compute.internal uid/51804a72-f2ce-48d1-b2a9-57043427e3c8 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0517 00:41:11.941248       1 cmd.go:216] Using insecure, self-signed certificates\nI0517 00:41:11.941569       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715906471 cert, and key in /tmp/serving-cert-1669256018/serving-signer.crt, /tmp/serving-cert-1669256018/serving-signer.key\nI0517 00:41:12.172551       1 observer_polling.go:159] Starting file observer\nW0517 00:41:12.234055       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-162-195.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0517 00:41:12.234173       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0517 00:41:12.247326       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1669256018/tls.crt::/tmp/serving-cert-1669256018/tls.key"\nF0517 00:41:12.589600       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 17 00:41:21.380 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-162-195.us-west-1.compute.internal node/ip-10-0-162-195.us-west-1.compute.internal uid/51804a72-f2ce-48d1-b2a9-57043427e3c8 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0517 00:41:11.941248       1 cmd.go:216] Using insecure, self-signed certificates\nI0517 00:41:11.941569       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715906471 cert, and key in /tmp/serving-cert-1669256018/serving-signer.crt, /tmp/serving-cert-1669256018/serving-signer.key\nI0517 00:41:12.172551       1 observer_polling.go:159] Starting file observer\nW0517 00:41:12.234055       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-162-195.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0517 00:41:12.234173       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0517 00:41:12.247326       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1669256018/tls.crt::/tmp/serving-cert-1669256018/tls.key"\nF0517 00:41:12.589600       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1791711138363215872junit15 hours ago
May 18 07:36:54.236 E ns/openshift-monitoring pod/node-exporter-gqgwm node/ip-10-0-204-87.us-west-1.compute.internal uid/5f8f55fc-17e7-41aa-a1c2-5af646cb3f45 container/node-exporter reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 18 07:36:58.862 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-204-87.us-west-1.compute.internal node/ip-10-0-204-87.us-west-1.compute.internal uid/3288de74-dbd5-4df5-900b-5603ec64624e container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0518 07:36:57.439150       1 cmd.go:216] Using insecure, self-signed certificates\nI0518 07:36:57.451288       1 crypto.go:601] Generating new CA for check-endpoints-signer@1716017817 cert, and key in /tmp/serving-cert-4174371775/serving-signer.crt, /tmp/serving-cert-4174371775/serving-signer.key\nI0518 07:36:57.770992       1 observer_polling.go:159] Starting file observer\nW0518 07:36:57.785133       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-204-87.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0518 07:36:57.785343       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0518 07:36:57.799638       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-4174371775/tls.crt::/tmp/serving-cert-4174371775/tls.key"\nF0518 07:36:57.989460       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 18 07:37:01.841 E ns/openshift-dns pod/node-resolver-nkvbt node/ip-10-0-204-87.us-west-1.compute.internal uid/615e08a7-ef1d-4e0f-a825-e158a6cb7e63 container/dns-node-resolver reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 3 lines not shown

#1791424576916295680junit34 hours ago
May 17 12:29:02.016 E ns/openshift-dns pod/dns-default-qnvgt node/ip-10-0-173-87.ec2.internal uid/abf38bd2-2244-439e-a94c-4f6b1361647f container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 17 12:29:02.059 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-173-87.ec2.internal node/ip-10-0-173-87.ec2.internal uid/35394f2f-88f6-4fb6-8d15-c77040f499b6 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0517 12:29:00.844536       1 cmd.go:216] Using insecure, self-signed certificates\nI0517 12:29:00.845062       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715948940 cert, and key in /tmp/serving-cert-3471517965/serving-signer.crt, /tmp/serving-cert-3471517965/serving-signer.key\nI0517 12:29:01.249661       1 observer_polling.go:159] Starting file observer\nW0517 12:29:01.270071       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-173-87.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0517 12:29:01.270290       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0517 12:29:01.279895       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3471517965/tls.crt::/tmp/serving-cert-3471517965/tls.key"\nF0517 12:29:01.811670       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 17 12:29:02.078 E ns/openshift-multus pod/network-metrics-daemon-hxbf8 node/ip-10-0-173-87.ec2.internal uid/aabef455-4574-41d8-a9a7-c4b14560a5bf container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 4 lines not shown

#1791156896313380864junit2 days ago
May 16 18:53:17.733 E ns/openshift-dns pod/dns-default-w5gw6 node/ip-10-0-159-222.us-west-1.compute.internal uid/4bad4527-e265-4578-a65c-139b8e06a8a1 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 16 18:53:25.670 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-159-222.us-west-1.compute.internal node/ip-10-0-159-222.us-west-1.compute.internal uid/1420125e-b751-4c9a-be39-fc54a64af553 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0516 18:53:16.998353       1 cmd.go:216] Using insecure, self-signed certificates\nI0516 18:53:16.998856       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715885596 cert, and key in /tmp/serving-cert-270578083/serving-signer.crt, /tmp/serving-cert-270578083/serving-signer.key\nI0516 18:53:17.292904       1 observer_polling.go:159] Starting file observer\nW0516 18:53:17.340231       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-159-222.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0516 18:53:17.340409       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0516 18:53:17.370131       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-270578083/tls.crt::/tmp/serving-cert-270578083/tls.key"\nW0516 18:53:25.246397       1 requestheader_controller.go:193] Unable to get configmap/extension-apiserver-authentication in kube-system.  Usually fixed by 'kubectl create rolebinding -n kube-system ROLEBINDING_NAME --role=extension-apiserver-authentication-reader --serviceaccount=YOUR_NS:YOUR_SA'\nF0516 18:53:25.246431       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: configmaps "extension-apiserver-authentication" is forbidden: User "system:serviceaccount:openshift-kube-apiserver:check-endpoints" cannot get resource "configmaps" in API group "" in the namespace "kube-system"\n
May 16 18:53:27.834 E ns/openshift-multus pod/multus-additional-cni-plugins-bdj8b node/ip-10-0-159-222.us-west-1.compute.internal uid/c4a8e225-933e-4b52-ad9a-cfef4db96151 container/kube-multus-additional-cni-plugins reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 2 lines not shown

#1791042149224026112junit2 days ago
May 16 11:05:41.290 E ns/openshift-multus pod/network-metrics-daemon-ncpzh node/ip-10-0-190-170.us-west-2.compute.internal uid/de5dcdd7-80bb-4f86-b2e3-84efb6122327 container/network-metrics-daemon reason/ContainerExit code/137 cause/ContainerStatusUnknown The container could not be located when the pod was deleted.  The container used to be Running
May 16 11:05:53.956 E ns/openshift-sdn pod/sdn-controller-22jxp node/ip-10-0-146-20.us-west-2.compute.internal uid/14032a76-1bd1-4883-8530-45ec8d401d8e container/sdn-controller reason/ContainerExit code/2 cause/Error I0516 10:02:30.275152       1 server.go:27] Starting HTTP metrics server\nI0516 10:02:30.275247       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0516 10:09:42.214611       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0516 10:10:23.406894       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.225.169:6443: connect: connection refused\nE0516 10:11:03.415790       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.225.169:6443: connect: connection refused\nE0516 10:15:04.984605       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.225.169:6443: connect: connection refused\nE0516 10:18:31.898218       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.225.169:6443: connect: connection refused\n
May 16 11:05:58.440 E ns/openshift-sdn pod/sdn-jvzv4 node/ip-10-0-204-106.us-west-2.compute.internal uid/7288b5b4-274d-41ed-b001-9c1da260fc8d container/kube-rbac-proxy reason/ContainerExit code/137 cause/ContainerStatusUnknown The container could not be located when the pod was deleted.  The container used to be Running
#1791042149224026112junit2 days ago
May 16 11:06:03.403 E ns/openshift-multus pod/multus-additional-cni-plugins-jmrsw node/ip-10-0-190-170.us-west-2.compute.internal uid/1df73e75-0173-4b70-b99a-1155cf46b984 container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 16 11:06:09.434 E ns/openshift-sdn pod/sdn-controller-fsnss node/ip-10-0-190-170.us-west-2.compute.internal uid/02722aaa-d760-4f52-a4be-6b0a554c11b8 container/sdn-controller reason/ContainerExit code/2 cause/Error I0516 10:02:31.965051       1 server.go:27] Starting HTTP metrics server\nI0516 10:02:31.965153       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0516 10:09:49.471716       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0516 10:10:42.350331       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.225.169:6443: connect: connection refused\nE0516 10:11:21.131343       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.156.72:6443: connect: connection refused\nE0516 10:18:18.346789       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-6wzds64s-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.156.72:6443: connect: connection refused\n
May 16 11:06:13.192 E ns/openshift-multus pod/cni-sysctl-allowlist-ds-bd4jn node/ip-10-0-136-106.us-west-2.compute.internal uid/9a34603d-0fb3-4824-be1b-394430386934 container/kube-multus-additional-cni-plugins reason/ContainerExit code/137 cause/Error
#1790949643945775104junit2 days ago
May 16 05:00:15.431 - 999ms E ns/openshift-console route/console disruption/ingress-to-console connection/new reason/DisruptionBegan ns/openshift-console route/console disruption/ingress-to-console connection/new stopped responding to GET requests over new connections: Get "https://console-openshift-console.apps.ci-op-sgzzxi6p-0e208.aws-2.ci.openshift.org/healthz": read tcp 10.130.91.5:59614->3.136.149.125:443: read: connection reset by peer
May 16 05:00:15.782 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-217-11.us-east-2.compute.internal node/ip-10-0-217-11.us-east-2.compute.internal uid/0a81991d-fd35-41ee-a8c8-1a8cb4ff2ba3 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0516 05:00:13.920969       1 cmd.go:216] Using insecure, self-signed certificates\nI0516 05:00:13.934938       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715835613 cert, and key in /tmp/serving-cert-2411900970/serving-signer.crt, /tmp/serving-cert-2411900970/serving-signer.key\nI0516 05:00:14.432225       1 observer_polling.go:159] Starting file observer\nW0516 05:00:14.469336       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-217-11.us-east-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0516 05:00:14.469484       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0516 05:00:14.483164       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2411900970/tls.crt::/tmp/serving-cert-2411900970/tls.key"\nF0516 05:00:14.798868       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 16 05:00:20.356 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-217-11.us-east-2.compute.internal node/ip-10-0-217-11.us-east-2.compute.internal uid/0a81991d-fd35-41ee-a8c8-1a8cb4ff2ba3 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0516 05:00:13.920969       1 cmd.go:216] Using insecure, self-signed certificates\nI0516 05:00:13.934938       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715835613 cert, and key in /tmp/serving-cert-2411900970/serving-signer.crt, /tmp/serving-cert-2411900970/serving-signer.key\nI0516 05:00:14.432225       1 observer_polling.go:159] Starting file observer\nW0516 05:00:14.469336       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-217-11.us-east-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0516 05:00:14.469484       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0516 05:00:14.483164       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2411900970/tls.crt::/tmp/serving-cert-2411900970/tls.key"\nF0516 05:00:14.798868       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1790738273920880640junit3 days ago
May 15 15:10:33.000 E ns/openshift-monitoring pod/node-exporter-82zbw node/ip-10-0-237-187.us-west-2.compute.internal uid/8814ba71-dbc8-4646-83f1-2408a2891834 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 15 15:10:39.836 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-237-187.us-west-2.compute.internal node/ip-10-0-237-187.us-west-2.compute.internal uid/efa8a8cb-3abf-45a2-990d-b04b00e5adc9 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0515 15:10:38.080796       1 cmd.go:216] Using insecure, self-signed certificates\nI0515 15:10:38.094185       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715785838 cert, and key in /tmp/serving-cert-1901414821/serving-signer.crt, /tmp/serving-cert-1901414821/serving-signer.key\nI0515 15:10:38.498035       1 observer_polling.go:159] Starting file observer\nW0515 15:10:38.562029       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-237-187.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0515 15:10:38.562181       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0515 15:10:38.595631       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1901414821/tls.crt::/tmp/serving-cert-1901414821/tls.key"\nF0515 15:10:38.728616       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 15 15:10:40.064 - 999ms E ns/openshift-authentication route/oauth-openshift disruption/ingress-to-oauth-server connection/new reason/DisruptionBegan ns/openshift-authentication route/oauth-openshift disruption/ingress-to-oauth-server connection/new stopped responding to GET requests over new connections: Get "https://oauth-openshift.apps.ci-op-tfgy8ljk-0e208.aws-2.ci.openshift.org/healthz": read tcp 10.130.150.154:45384->54.187.15.100:443: read: connection reset by peer
#1790738273920880640junit3 days ago
May 15 15:10:41.989 E ns/openshift-cluster-csi-drivers pod/aws-ebs-csi-driver-node-9nftr node/ip-10-0-237-187.us-west-2.compute.internal uid/3e3e88b3-2813-47b0-9ae4-da21a9bd9a9b container/csi-node-driver-registrar reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 15 15:10:42.019 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-237-187.us-west-2.compute.internal node/ip-10-0-237-187.us-west-2.compute.internal uid/efa8a8cb-3abf-45a2-990d-b04b00e5adc9 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0515 15:10:38.080796       1 cmd.go:216] Using insecure, self-signed certificates\nI0515 15:10:38.094185       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715785838 cert, and key in /tmp/serving-cert-1901414821/serving-signer.crt, /tmp/serving-cert-1901414821/serving-signer.key\nI0515 15:10:38.498035       1 observer_polling.go:159] Starting file observer\nW0515 15:10:38.562029       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-237-187.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0515 15:10:38.562181       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0515 15:10:38.595631       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1901414821/tls.crt::/tmp/serving-cert-1901414821/tls.key"\nF0515 15:10:38.728616       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 15 15:10:43.014 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-237-187.us-west-2.compute.internal node/ip-10-0-237-187.us-west-2.compute.internal uid/efa8a8cb-3abf-45a2-990d-b04b00e5adc9 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0515 15:10:41.659846       1 cmd.go:216] Using insecure, self-signed certificates\nI0515 15:10:41.660359       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715785841 cert, and key in /tmp/serving-cert-653763215/serving-signer.crt, /tmp/serving-cert-653763215/serving-signer.key\nI0515 15:10:42.115594       1 observer_polling.go:159] Starting file observer\nW0515 15:10:42.119794       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-237-187.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0515 15:10:42.119970       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0515 15:10:42.120511       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-653763215/tls.crt::/tmp/serving-cert-653763215/tls.key"\nF0515 15:10:42.391235       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1790589768447299584junit3 days ago
May 15 05:18:15.939 E ns/openshift-multus pod/multus-additional-cni-plugins-6bknz node/ip-10-0-152-67.us-west-2.compute.internal uid/128308be-7364-4e7d-9a21-5ac2ec17aafb container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 15 05:18:31.367 E ns/openshift-sdn pod/sdn-controller-vc4zb node/ip-10-0-183-38.us-west-2.compute.internal uid/bc402dea-bc6a-4a8d-9c06-9f6f9db423a2 container/sdn-controller reason/ContainerExit code/2 cause/Error I0515 04:16:28.668044       1 server.go:27] Starting HTTP metrics server\nI0515 04:16:28.668132       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0515 04:23:14.755191       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0515 04:30:48.442371       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-svwdnn7q-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.207.210:6443: connect: connection refused\n
May 15 05:18:40.018 E ns/openshift-sdn pod/sdn-s54d6 node/ip-10-0-133-205.us-west-2.compute.internal uid/52862132-c2dc-4241-9f41-722151f6363b container/sdn reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1790589768447299584junit3 days ago
May 15 05:18:48.876 E ns/openshift-multus pod/multus-additional-cni-plugins-vkz9c node/ip-10-0-174-240.us-west-2.compute.internal uid/7955723b-edfc-4eba-acf3-cfda2d4cf35e container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 15 05:18:49.687 E ns/openshift-sdn pod/sdn-controller-w9htt node/ip-10-0-222-205.us-west-2.compute.internal uid/65f9f64f-af2c-48e3-a117-42597240914b container/sdn-controller reason/ContainerExit code/2 cause/Error I0515 04:16:28.585533       1 server.go:27] Starting HTTP metrics server\nI0515 04:16:28.585896       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0515 04:22:59.721219       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0515 04:24:33.995053       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-svwdnn7q-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.132.72:6443: connect: connection refused\nE0515 04:27:14.855901       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-svwdnn7q-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.207.210:6443: connect: connection refused\nE0515 04:30:20.697064       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-svwdnn7q-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.207.210:6443: connect: connection refused\nE0515 04:30:59.421252       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-svwdnn7q-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.207.210:6443: connect: connection refused\n
May 15 05:18:50.037 E ns/openshift-network-diagnostics pod/network-check-target-xvfh7 node/ip-10-0-133-205.us-west-2.compute.internal uid/eeee5b02-d1cd-4ace-8e33-2c639eee2460 container/network-check-target-container reason/ContainerExit code/2 cause/Error
#1789948963374239744junit5 days ago
May 13 10:56:40.167 E ns/openshift-monitoring pod/node-exporter-w684x node/ip-10-0-176-169.us-west-1.compute.internal uid/b6532fb3-d89d-4576-97ef-e25c4fa95fbf container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 13 10:56:44.896 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-176-169.us-west-1.compute.internal node/ip-10-0-176-169.us-west-1.compute.internal uid/e75919a3-1ce9-4d87-802c-9f55884ce22b container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0513 10:56:43.732815       1 cmd.go:216] Using insecure, self-signed certificates\nI0513 10:56:43.736677       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715597803 cert, and key in /tmp/serving-cert-4276888294/serving-signer.crt, /tmp/serving-cert-4276888294/serving-signer.key\nI0513 10:56:44.213793       1 observer_polling.go:159] Starting file observer\nW0513 10:56:44.236167       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-176-169.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0513 10:56:44.236325       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0513 10:56:44.251520       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-4276888294/tls.crt::/tmp/serving-cert-4276888294/tls.key"\nF0513 10:56:44.544134       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 13 10:56:45.901 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-176-169.us-west-1.compute.internal node/ip-10-0-176-169.us-west-1.compute.internal uid/e75919a3-1ce9-4d87-802c-9f55884ce22b container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0513 10:56:43.732815       1 cmd.go:216] Using insecure, self-signed certificates\nI0513 10:56:43.736677       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715597803 cert, and key in /tmp/serving-cert-4276888294/serving-signer.crt, /tmp/serving-cert-4276888294/serving-signer.key\nI0513 10:56:44.213793       1 observer_polling.go:159] Starting file observer\nW0513 10:56:44.236167       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-176-169.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0513 10:56:44.236325       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0513 10:56:44.251520       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-4276888294/tls.crt::/tmp/serving-cert-4276888294/tls.key"\nF0513 10:56:44.544134       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 2 lines not shown

#1790317739466821632junit4 days ago
May 14 11:05:35.549 E ns/openshift-multus pod/multus-additional-cni-plugins-bzv25 node/ip-10-0-143-126.us-west-2.compute.internal uid/9cb5b595-1dc2-42db-856f-7228213cdb49 container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 14 11:05:51.766 E ns/openshift-sdn pod/sdn-controller-jr44t node/ip-10-0-143-126.us-west-2.compute.internal uid/db142159-f638-4df3-8c1e-a13c1ed5d5f8 container/sdn-controller reason/ContainerExit code/2 cause/Error I0514 10:01:22.897412       1 server.go:27] Starting HTTP metrics server\nI0514 10:01:22.897500       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0514 10:10:10.563803       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0514 10:11:31.927985       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: leases.coordination.k8s.io "openshift-network-controller" is forbidden: User "system:serviceaccount:openshift-sdn:sdn-controller" cannot get resource "leases" in API group "coordination.k8s.io" in the namespace "openshift-sdn": RBAC: [role.rbac.authorization.k8s.io "openshift-sdn-controller-leaderelection" not found, clusterrole.rbac.authorization.k8s.io "system:image-puller" not found]\nE0514 10:19:11.337318       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-0lvkzkfx-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.131.129:6443: connect: connection refused\n
May 14 11:05:59.727 E ns/openshift-network-diagnostics pod/network-check-target-wxsvh node/ip-10-0-182-146.us-west-2.compute.internal uid/e674637f-9869-45df-98ec-862f1f276b0c container/network-check-target-container reason/ContainerExit code/2 cause/Error
#1790317739466821632junit4 days ago
May 14 11:06:17.805 E ns/openshift-multus pod/multus-additional-cni-plugins-ndr4r node/ip-10-0-232-149.us-west-2.compute.internal uid/5a822c2a-fa91-4c91-9406-1df8c174794c container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 14 11:06:25.012 E ns/openshift-sdn pod/sdn-controller-7c4mb node/ip-10-0-182-146.us-west-2.compute.internal uid/3c993168-8d76-496d-9408-627df0a88d4b container/sdn-controller reason/ContainerExit code/2 cause/Error I0514 10:01:23.531953       1 server.go:27] Starting HTTP metrics server\nI0514 10:01:23.532044       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0514 10:09:33.238149       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0514 10:18:47.763503       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-0lvkzkfx-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.131.129:6443: connect: connection refused\nE0514 10:19:16.758408       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-0lvkzkfx-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.131.129:6443: connect: connection refused\n
May 14 11:06:32.130 E ns/openshift-multus pod/multus-admission-controller-7c5f5dbb5b-rpsc8 node/ip-10-0-232-149.us-west-2.compute.internal uid/17c47896-a2e7-41ad-899f-5f312bca1f44 container/multus-admission-controller reason/ContainerExit code/137 cause/Error
#1790408687643267072junit4 days ago
May 14 17:11:42.455 E ns/openshift-sdn pod/sdn-controller-b7rzz node/ip-10-0-130-5.us-east-2.compute.internal uid/ffbed142-e910-4947-9926-ccdaec8ff6a3 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 14 17:11:43.479 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-130-5.us-east-2.compute.internal node/ip-10-0-130-5.us-east-2.compute.internal uid/639afad0-ce42-45ed-91e2-1fd67b842382 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0514 17:11:42.106427       1 cmd.go:216] Using insecure, self-signed certificates\nI0514 17:11:42.107191       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715706702 cert, and key in /tmp/serving-cert-3885880230/serving-signer.crt, /tmp/serving-cert-3885880230/serving-signer.key\nI0514 17:11:42.687285       1 observer_polling.go:159] Starting file observer\nW0514 17:11:42.712631       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-130-5.us-east-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0514 17:11:42.712771       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0514 17:11:42.754166       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3885880230/tls.crt::/tmp/serving-cert-3885880230/tls.key"\nF0514 17:11:43.118908       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 14 17:11:44.531 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-130-5.us-east-2.compute.internal node/ip-10-0-130-5.us-east-2.compute.internal uid/639afad0-ce42-45ed-91e2-1fd67b842382 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0514 17:11:42.106427       1 cmd.go:216] Using insecure, self-signed certificates\nI0514 17:11:42.107191       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715706702 cert, and key in /tmp/serving-cert-3885880230/serving-signer.crt, /tmp/serving-cert-3885880230/serving-signer.key\nI0514 17:11:42.687285       1 observer_polling.go:159] Starting file observer\nW0514 17:11:42.712631       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-130-5.us-east-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0514 17:11:42.712771       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0514 17:11:42.754166       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3885880230/tls.crt::/tmp/serving-cert-3885880230/tls.key"\nF0514 17:11:43.118908       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1790039506817126400junit5 days ago
May 13 16:54:56.787 E ns/openshift-multus pod/multus-psr5f node/ip-10-0-153-11.us-west-2.compute.internal uid/4bdd20e0-5e69-47f2-b105-46c939d7714b container/kube-multus reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 13 16:55:02.496 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-153-11.us-west-2.compute.internal node/ip-10-0-153-11.us-west-2.compute.internal uid/98d973c7-4b43-4ae7-94de-66aff65b8b1b container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0513 16:55:01.082563       1 cmd.go:216] Using insecure, self-signed certificates\nI0513 16:55:01.082906       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715619301 cert, and key in /tmp/serving-cert-2241620786/serving-signer.crt, /tmp/serving-cert-2241620786/serving-signer.key\nI0513 16:55:01.330850       1 observer_polling.go:159] Starting file observer\nW0513 16:55:01.344729       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-153-11.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0513 16:55:01.344871       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0513 16:55:01.351668       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2241620786/tls.crt::/tmp/serving-cert-2241620786/tls.key"\nF0513 16:55:01.519272       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 13 16:55:03.545 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-153-11.us-west-2.compute.internal node/ip-10-0-153-11.us-west-2.compute.internal uid/98d973c7-4b43-4ae7-94de-66aff65b8b1b container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0513 16:55:01.082563       1 cmd.go:216] Using insecure, self-signed certificates\nI0513 16:55:01.082906       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715619301 cert, and key in /tmp/serving-cert-2241620786/serving-signer.crt, /tmp/serving-cert-2241620786/serving-signer.key\nI0513 16:55:01.330850       1 observer_polling.go:159] Starting file observer\nW0513 16:55:01.344729       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-153-11.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0513 16:55:01.344871       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0513 16:55:01.351668       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2241620786/tls.crt::/tmp/serving-cert-2241620786/tls.key"\nF0513 16:55:01.519272       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1788946036887130112junit8 days ago
May 10 16:20:34.379 E ns/openshift-cluster-csi-drivers pod/aws-ebs-csi-driver-node-kq8bv node/ip-10-0-202-194.ec2.internal uid/0f68d5ce-7c7b-4530-9ff7-76840ee41106 container/csi-node-driver-registrar reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 10 16:20:36.232 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-202-194.ec2.internal node/ip-10-0-202-194.ec2.internal uid/df235032-f4b8-49f8-94fd-82dfb5a62115 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0510 16:20:34.345879       1 cmd.go:216] Using insecure, self-signed certificates\nI0510 16:20:34.357221       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715358034 cert, and key in /tmp/serving-cert-1567204266/serving-signer.crt, /tmp/serving-cert-1567204266/serving-signer.key\nI0510 16:20:35.184819       1 observer_polling.go:159] Starting file observer\nW0510 16:20:35.203031       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-202-194.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0510 16:20:35.203156       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0510 16:20:35.230198       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1567204266/tls.crt::/tmp/serving-cert-1567204266/tls.key"\nF0510 16:20:35.505607       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 10 16:20:42.895 E ns/openshift-network-diagnostics pod/network-check-target-fr8l8 node/ip-10-0-202-194.ec2.internal uid/8e5bedfe-81a0-46b2-8eb5-f31ffaaed1bc container/network-check-target-container reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 2 lines not shown

#1788848856033660928junit8 days ago
May 10 10:21:08.082 E ns/openshift-sdn pod/sdn-controller-sskm9 node/ip-10-0-248-85.ec2.internal uid/41bcd6cb-1df1-4803-b378-385a42fcceaa container/sdn-controller reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 10 10:21:09.104 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-248-85.ec2.internal node/ip-10-0-248-85.ec2.internal uid/63e8a4f7-ac9a-4afe-9bf3-75b37034dcfe container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0510 10:21:08.001577       1 cmd.go:216] Using insecure, self-signed certificates\nI0510 10:21:08.001963       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715336468 cert, and key in /tmp/serving-cert-3297632347/serving-signer.crt, /tmp/serving-cert-3297632347/serving-signer.key\nI0510 10:21:08.455077       1 observer_polling.go:159] Starting file observer\nW0510 10:21:08.465312       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-248-85.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0510 10:21:08.465482       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0510 10:21:08.479795       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3297632347/tls.crt::/tmp/serving-cert-3297632347/tls.key"\nF0510 10:21:08.814674       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 10 10:21:13.244 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-248-85.ec2.internal node/ip-10-0-248-85.ec2.internal uid/63e8a4f7-ac9a-4afe-9bf3-75b37034dcfe container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0510 10:21:08.001577       1 cmd.go:216] Using insecure, self-signed certificates\nI0510 10:21:08.001963       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715336468 cert, and key in /tmp/serving-cert-3297632347/serving-signer.crt, /tmp/serving-cert-3297632347/serving-signer.key\nI0510 10:21:08.455077       1 observer_polling.go:159] Starting file observer\nW0510 10:21:08.465312       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-248-85.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0510 10:21:08.465482       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0510 10:21:08.479795       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3297632347/tls.crt::/tmp/serving-cert-3297632347/tls.key"\nF0510 10:21:08.814674       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1788556207703724032junit9 days ago
May 09 14:29:35.184 E ns/openshift-multus pod/cni-sysctl-allowlist-ds-h5hlc node/ip-10-0-255-216.us-west-2.compute.internal uid/c222eb18-248e-48ad-b1cc-6d8a4254207a container/kube-multus-additional-cni-plugins reason/ContainerExit code/137 cause/Error
May 09 14:29:37.381 E ns/openshift-sdn pod/sdn-controller-bbbfm node/ip-10-0-206-134.us-west-2.compute.internal uid/c963f6c4-0991-48b0-ac3d-fcb80706da05 container/sdn-controller reason/ContainerExit code/2 cause/Error I0509 13:21:07.211462       1 server.go:27] Starting HTTP metrics server\nI0509 13:21:07.211563       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0509 13:30:49.928153       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0509 13:31:39.715927       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-qpth2zw8-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.237.170:6443: connect: connection refused\nE0509 13:41:01.519186       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0509 13:41:41.741100       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-qpth2zw8-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.237.170:6443: connect: connection refused\n
May 09 14:29:40.086 E ns/openshift-sdn pod/sdn-wl7wj node/ip-10-0-138-20.us-west-2.compute.internal uid/2b08b5fd-263d-477e-978d-0883514792ce container/sdn reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1788556207703724032junit9 days ago
May 09 14:43:51.518 E ns/openshift-dns pod/node-resolver-2xxb5 node/ip-10-0-206-134.us-west-2.compute.internal uid/281036e3-3946-4029-ba7f-a0f5ff7ea6df container/dns-node-resolver reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 09 14:43:55.546 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-206-134.us-west-2.compute.internal node/ip-10-0-206-134.us-west-2.compute.internal uid/efaad8dd-22b8-4c9d-aaa1-17db13418d8a container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0509 14:43:53.542021       1 cmd.go:216] Using insecure, self-signed certificates\nI0509 14:43:53.559961       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715265833 cert, and key in /tmp/serving-cert-4261999365/serving-signer.crt, /tmp/serving-cert-4261999365/serving-signer.key\nI0509 14:43:53.931996       1 observer_polling.go:159] Starting file observer\nW0509 14:43:53.958669       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-206-134.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0509 14:43:53.958899       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0509 14:43:53.972847       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-4261999365/tls.crt::/tmp/serving-cert-4261999365/tls.key"\nF0509 14:43:54.433277       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 09 14:43:56.964 - 1s    E ns/openshift-authentication route/oauth-openshift disruption/ingress-to-oauth-server connection/new reason/DisruptionBegan ns/openshift-authentication route/oauth-openshift disruption/ingress-to-oauth-server connection/new stopped responding to GET requests over new connections: Get "https://oauth-openshift.apps.ci-op-qpth2zw8-0e208.aws-2.ci.openshift.org/healthz": read tcp 10.129.14.183:49452->44.231.145.254:443: read: connection reset by peer
#1788371003525566464junit9 days ago
May 09 02:11:08.924 E ns/openshift-multus pod/multus-additional-cni-plugins-nf9h7 node/ip-10-0-169-75.us-west-1.compute.internal uid/d6163adf-0770-4a4b-93c5-2d5aae56b34d container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 09 02:11:24.506 E ns/openshift-sdn pod/sdn-controller-mkxpj node/ip-10-0-244-155.us-west-1.compute.internal uid/df3c2ba0-3b75-43d5-ae05-792e23c1174b container/sdn-controller reason/ContainerExit code/2 cause/Error I0509 01:14:58.642618       1 server.go:27] Starting HTTP metrics server\nI0509 01:14:58.642731       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0509 01:14:58.646969       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-hdgwnjrr-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.203.149:6443: connect: connection refused\nE0509 01:15:39.406785       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-hdgwnjrr-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.203.149:6443: connect: connection refused\nE0509 01:16:30.999673       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-hdgwnjrr-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.170.237:6443: connect: connection refused\n
May 09 02:11:24.506 E ns/openshift-sdn pod/sdn-controller-mkxpj node/ip-10-0-244-155.us-west-1.compute.internal uid/df3c2ba0-3b75-43d5-ae05-792e23c1174b container/sdn-controller reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1788371003525566464junit9 days ago
May 09 02:25:23.723 E ns/openshift-image-registry pod/node-ca-mwwjb node/ip-10-0-149-116.us-west-1.compute.internal uid/8e161bc4-b267-4cea-84ac-cfc47daa0b2f container/node-ca reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 09 02:25:26.561 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-149-116.us-west-1.compute.internal node/ip-10-0-149-116.us-west-1.compute.internal uid/37985911-02ea-413b-aa88-a84805e2a97f container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0509 02:25:25.109774       1 cmd.go:216] Using insecure, self-signed certificates\nI0509 02:25:25.124449       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715221525 cert, and key in /tmp/serving-cert-2476285636/serving-signer.crt, /tmp/serving-cert-2476285636/serving-signer.key\nI0509 02:25:25.384811       1 observer_polling.go:159] Starting file observer\nW0509 02:25:25.392878       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-149-116.us-west-1.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0509 02:25:25.393122       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0509 02:25:25.410279       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2476285636/tls.crt::/tmp/serving-cert-2476285636/tls.key"\nF0509 02:25:25.944166       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 09 02:25:29.563 E ns/openshift-dns pod/node-resolver-s8gvh node/ip-10-0-149-116.us-west-1.compute.internal uid/b53bf7d0-96c9-46c2-8866-5026c734fafd container/dns-node-resolver reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1788695372336467968junit8 days ago
May 09 23:48:47.177 E ns/openshift-multus pod/network-metrics-daemon-rbj7b node/ip-10-0-173-182.us-west-2.compute.internal uid/b961567a-5a2f-4bdc-bd98-c6e653cb5e53 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 09 23:48:47.206 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-173-182.us-west-2.compute.internal node/ip-10-0-173-182.us-west-2.compute.internal uid/13f7947d-e34d-4478-a75e-dd2027c4cd44 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0509 23:48:44.729248       1 cmd.go:216] Using insecure, self-signed certificates\nI0509 23:48:44.747573       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715298524 cert, and key in /tmp/serving-cert-2516282367/serving-signer.crt, /tmp/serving-cert-2516282367/serving-signer.key\nI0509 23:48:45.702526       1 observer_polling.go:159] Starting file observer\nW0509 23:48:45.714392       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-173-182.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0509 23:48:45.714584       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0509 23:48:45.730270       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2516282367/tls.crt::/tmp/serving-cert-2516282367/tls.key"\nF0509 23:48:46.006288       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 09 23:48:47.252 E ns/openshift-dns pod/dns-default-mkznz node/ip-10-0-173-182.us-west-2.compute.internal uid/1fa558e2-8ef1-4f3e-8ab4-22644eb109f6 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 3 lines not shown

#1788185400976609280junit10 days ago
May 08 13:48:54.740 E ns/openshift-multus pod/multus-additional-cni-plugins-pr77q node/ip-10-0-240-218.us-west-2.compute.internal uid/02890180-de25-433a-8654-437adf3a33a1 container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 08 13:48:58.733 E ns/openshift-sdn pod/sdn-controller-f7plz node/ip-10-0-240-218.us-west-2.compute.internal uid/94863e1c-8179-4638-8de2-3268776c883c container/sdn-controller reason/ContainerExit code/2 cause/Error I0508 12:48:28.331480       1 server.go:27] Starting HTTP metrics server\nI0508 12:48:28.331576       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0508 12:58:21.095640       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-3gkqiid4-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.185.158:6443: connect: connection refused\n
May 08 13:49:13.081 E ns/openshift-multus pod/multus-admission-controller-7d69479cf8-rqwjg node/ip-10-0-148-94.us-west-2.compute.internal uid/8afb519c-9392-40bb-8403-6309baf70121 container/multus-admission-controller reason/ContainerExit code/137 cause/Error
#1788185400976609280junit10 days ago
May 08 14:03:07.472 E ns/openshift-dns pod/node-resolver-n2pxm node/ip-10-0-240-218.us-west-2.compute.internal uid/7aa374f3-c0d2-4bc8-8c8e-4569de3d9152 container/dns-node-resolver reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 08 14:03:11.151 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-240-218.us-west-2.compute.internal node/ip-10-0-240-218.us-west-2.compute.internal uid/fc9ebc9f-6d75-4c9b-9fed-ad4e23abb924 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0508 14:03:09.642273       1 cmd.go:216] Using insecure, self-signed certificates\nI0508 14:03:09.656247       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715176989 cert, and key in /tmp/serving-cert-2385254311/serving-signer.crt, /tmp/serving-cert-2385254311/serving-signer.key\nI0508 14:03:10.425517       1 observer_polling.go:159] Starting file observer\nW0508 14:03:10.440769       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-240-218.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0508 14:03:10.440894       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0508 14:03:10.452227       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2385254311/tls.crt::/tmp/serving-cert-2385254311/tls.key"\nF0508 14:03:10.780415       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 08 14:03:13.395 - 1s    E ns/openshift-authentication route/oauth-openshift disruption/ingress-to-oauth-server connection/new reason/DisruptionBegan ns/openshift-authentication route/oauth-openshift disruption/ingress-to-oauth-server connection/new stopped responding to GET requests over new connections: Get "https://oauth-openshift.apps.ci-op-3gkqiid4-0e208.aws-2.ci.openshift.org/healthz": read tcp 10.129.20.133:42006->35.155.100.245:443: read: connection reset by peer
#1788094773521813504junit10 days ago
May 08 07:48:56.865 E ns/openshift-multus pod/multus-additional-cni-plugins-jjpz8 node/ip-10-0-230-178.ec2.internal uid/5257fe37-bfb9-48b0-a7d4-f52b4ff821af container/kube-multus-additional-cni-plugins reason/ContainerExit code/143 cause/Error
May 08 07:49:11.851 E ns/openshift-sdn pod/sdn-controller-4wjj7 node/ip-10-0-182-15.ec2.internal uid/991b7761-e7a9-4e06-9c36-264ef653cade container/sdn-controller reason/ContainerExit code/2 cause/Error I0508 06:49:19.488350       1 server.go:27] Starting HTTP metrics server\nI0508 06:49:19.488451       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0508 06:59:00.115074       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-b3vjcyvh-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.138.148:6443: connect: connection refused\nE0508 06:59:33.333953       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-b3vjcyvh-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.138.148:6443: connect: connection refused\n
May 08 07:49:19.799 E ns/openshift-network-diagnostics pod/network-check-target-n8kkv node/ip-10-0-180-5.ec2.internal uid/282bf839-a303-4503-8576-b16616ed7abf container/network-check-target-container reason/ContainerExit code/2 cause/Error

... 3 lines not shown

#1787997185929908224junit10 days ago
May 08 01:37:09.067 E ns/openshift-monitoring pod/node-exporter-vpdc4 node/ip-10-0-140-134.us-west-2.compute.internal uid/8c52e033-df2a-4a57-9b87-dbdd81d66ee5 container/node-exporter reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 08 01:37:13.227 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-140-134.us-west-2.compute.internal node/ip-10-0-140-134.us-west-2.compute.internal uid/62faaa7b-9f7e-4fb2-915c-f9c98ab67df5 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0508 01:37:11.934008       1 cmd.go:216] Using insecure, self-signed certificates\nI0508 01:37:11.934516       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715132231 cert, and key in /tmp/serving-cert-3712268509/serving-signer.crt, /tmp/serving-cert-3712268509/serving-signer.key\nI0508 01:37:12.271470       1 observer_polling.go:159] Starting file observer\nW0508 01:37:12.279792       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-140-134.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0508 01:37:12.279922       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0508 01:37:12.288418       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3712268509/tls.crt::/tmp/serving-cert-3712268509/tls.key"\nF0508 01:37:12.536266       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 08 01:37:16.287 E ns/e2e-k8s-sig-apps-daemonset-upgrade-3564 pod/ds1-5lnzf node/ip-10-0-140-134.us-west-2.compute.internal uid/9a6e8368-f28f-4325-b739-e007f9b11658 container/app reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
#1787997185929908224junit10 days ago
May 08 01:37:17.345 E ns/openshift-multus pod/network-metrics-daemon-v2lr2 node/ip-10-0-140-134.us-west-2.compute.internal uid/a7c50161-70b8-4001-9a0c-b96246403e43 container/kube-rbac-proxy reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 08 01:37:17.423 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-140-134.us-west-2.compute.internal node/ip-10-0-140-134.us-west-2.compute.internal uid/62faaa7b-9f7e-4fb2-915c-f9c98ab67df5 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0508 01:37:11.934008       1 cmd.go:216] Using insecure, self-signed certificates\nI0508 01:37:11.934516       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715132231 cert, and key in /tmp/serving-cert-3712268509/serving-signer.crt, /tmp/serving-cert-3712268509/serving-signer.key\nI0508 01:37:12.271470       1 observer_polling.go:159] Starting file observer\nW0508 01:37:12.279792       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-140-134.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0508 01:37:12.279922       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0508 01:37:12.288418       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3712268509/tls.crt::/tmp/serving-cert-3712268509/tls.key"\nF0508 01:37:12.536266       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 08 01:37:18.312 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-140-134.us-west-2.compute.internal node/ip-10-0-140-134.us-west-2.compute.internal uid/62faaa7b-9f7e-4fb2-915c-f9c98ab67df5 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0508 01:37:15.973760       1 cmd.go:216] Using insecure, self-signed certificates\nI0508 01:37:16.051604       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715132236 cert, and key in /tmp/serving-cert-3047924736/serving-signer.crt, /tmp/serving-cert-3047924736/serving-signer.key\nI0508 01:37:17.072449       1 observer_polling.go:159] Starting file observer\nW0508 01:37:17.074114       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-140-134.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0508 01:37:17.074255       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0508 01:37:17.074722       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3047924736/tls.crt::/tmp/serving-cert-3047924736/tls.key"\nF0508 01:37:17.310032       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1787870660417032192junit11 days ago
May 07 17:08:47.312 E ns/openshift-cluster-csi-drivers pod/aws-ebs-csi-driver-node-nmntx node/ip-10-0-164-45.ec2.internal uid/a5777136-51ea-4183-8973-fcdb0b991d5b container/csi-node-driver-registrar reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 07 17:08:52.051 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-164-45.ec2.internal node/ip-10-0-164-45.ec2.internal uid/0eaa3717-85b9-47b7-9307-dbf76549ab48 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0507 17:08:50.680008       1 cmd.go:216] Using insecure, self-signed certificates\nI0507 17:08:50.680515       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715101730 cert, and key in /tmp/serving-cert-2110908182/serving-signer.crt, /tmp/serving-cert-2110908182/serving-signer.key\nI0507 17:08:51.366163       1 observer_polling.go:159] Starting file observer\nW0507 17:08:51.387334       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-164-45.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0507 17:08:51.387577       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0507 17:08:51.396097       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-2110908182/tls.crt::/tmp/serving-cert-2110908182/tls.key"\nF0507 17:08:51.782898       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 07 17:08:52.064 E ns/e2e-k8s-sig-apps-daemonset-upgrade-2892 pod/ds1-trh5n node/ip-10-0-164-45.ec2.internal uid/e3711f9b-4dce-470d-b367-2a1336ca46d9 container/app reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 4 lines not shown

#1787768249904009216junit11 days ago
May 07 10:24:45.964 E ns/openshift-sdn pod/sdn-controller-hbfkc node/ip-10-0-161-112.ec2.internal uid/e16ae890-2e5a-493a-a3ca-089451dc252e container/sdn-controller reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 07 10:24:49.936 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-161-112.ec2.internal node/ip-10-0-161-112.ec2.internal uid/7af3c5f7-0d24-4a8f-8ba4-86f52476ea3b container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0507 10:24:48.787906       1 cmd.go:216] Using insecure, self-signed certificates\nI0507 10:24:48.793739       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715077488 cert, and key in /tmp/serving-cert-1712410536/serving-signer.crt, /tmp/serving-cert-1712410536/serving-signer.key\nI0507 10:24:49.139902       1 observer_polling.go:159] Starting file observer\nW0507 10:24:49.162615       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-161-112.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0507 10:24:49.162749       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0507 10:24:49.198841       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1712410536/tls.crt::/tmp/serving-cert-1712410536/tls.key"\nF0507 10:24:49.572275       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 07 10:24:50.959 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-161-112.ec2.internal node/ip-10-0-161-112.ec2.internal uid/7af3c5f7-0d24-4a8f-8ba4-86f52476ea3b container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0507 10:24:48.787906       1 cmd.go:216] Using insecure, self-signed certificates\nI0507 10:24:48.793739       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715077488 cert, and key in /tmp/serving-cert-1712410536/serving-signer.crt, /tmp/serving-cert-1712410536/serving-signer.key\nI0507 10:24:49.139902       1 observer_polling.go:159] Starting file observer\nW0507 10:24:49.162615       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-161-112.ec2.internal": dial tcp [::1]:6443: connect: connection refused\nI0507 10:24:49.162749       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0507 10:24:49.198841       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-1712410536/tls.crt::/tmp/serving-cert-1712410536/tls.key"\nF0507 10:24:49.572275       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n

... 1 lines not shown

#1787466113081151488junit12 days ago
May 06 14:26:34.975 E ns/openshift-sdn pod/sdn-controller-sgxmh node/ip-10-0-148-35.us-west-2.compute.internal uid/d5a45725-1eac-4dfd-b4fb-f092b09c6e46 container/sdn-controller reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)
May 06 14:26:38.001 E ns/openshift-kube-apiserver pod/kube-apiserver-ip-10-0-148-35.us-west-2.compute.internal node/ip-10-0-148-35.us-west-2.compute.internal uid/faef3c4d-31fe-41f6-a38a-17e0f4dbe881 container/kube-apiserver-check-endpoints reason/ContainerExit code/255 cause/Error W0506 14:26:37.112050       1 cmd.go:216] Using insecure, self-signed certificates\nI0506 14:26:37.130314       1 crypto.go:601] Generating new CA for check-endpoints-signer@1715005597 cert, and key in /tmp/serving-cert-3670388039/serving-signer.crt, /tmp/serving-cert-3670388039/serving-signer.key\nI0506 14:26:37.412124       1 observer_polling.go:159] Starting file observer\nW0506 14:26:37.445214       1 builder.go:239] unable to get owner reference (falling back to namespace): Get "https://localhost:6443/api/v1/namespaces/openshift-kube-apiserver/pods/kube-apiserver-ip-10-0-148-35.us-west-2.compute.internal": dial tcp [::1]:6443: connect: connection refused\nI0506 14:26:37.445525       1 builder.go:271] check-endpoints version v4.0.0-alpha.0-1810-g4d70179-4d7017904\nI0506 14:26:37.463288       1 dynamic_serving_content.go:113] "Loaded a new cert/key pair" name="serving-cert::/tmp/serving-cert-3670388039/tls.crt::/tmp/serving-cert-3670388039/tls.key"\nF0506 14:26:37.842858       1 cmd.go:141] error initializing delegating authentication: unable to load configmap based request-header-client-ca-file: Get "https://localhost:6443/api/v1/namespaces/kube-system/configmaps/extension-apiserver-authentication": dial tcp [::1]:6443: connect: connection refused\n
May 06 14:26:43.044 E ns/e2e-k8s-sig-apps-daemonset-upgrade-2974 pod/ds1-glc2k node/ip-10-0-148-35.us-west-2.compute.internal uid/cfd9fa06-57a4-403f-9915-cc781504d04e container/app reason/TerminationStateCleared lastState.terminated was cleared on a pod (bug https://bugzilla.redhat.com/show_bug.cgi?id=1933760 or similar)

... 2 lines not shown

#1787564745973305344junit12 days ago
May 06 20:44:34.568 E ns/openshift-multus pod/multus-sk4d8 node/ip-10-0-209-207.us-west-1.compute.internal uid/28e6a135-ab23-4cd0-804f-7d70ecce6c00 container/kube-multus reason/ContainerExit code/137 cause/ContainerStatusUnknown The container could not be located when the pod was deleted.  The container used to be Running
May 06 20:44:39.722 E ns/openshift-sdn pod/sdn-controller-gsqcb node/ip-10-0-237-187.us-west-1.compute.internal uid/334b17a0-0bc2-4bc6-92f9-1c5f0939b667 container/sdn-controller reason/ContainerExit code/2 cause/Error I0506 19:42:25.563719       1 server.go:27] Starting HTTP metrics server\nI0506 19:42:25.563817       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0506 19:49:28.781975       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0506 19:49:59.551678       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.254.243:6443: connect: connection refused\nE0506 19:53:10.473443       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.254.243:6443: connect: connection refused\nE0506 19:58:55.187152       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.128.253:6443: connect: connection refused\n
May 06 20:44:48.757 E ns/openshift-network-diagnostics pod/network-check-target-swwq5 node/ip-10-0-174-110.us-west-1.compute.internal uid/13c1d6af-5a04-49cb-835c-37e5e65b4b68 container/network-check-target-container reason/ContainerExit code/2 cause/Error
#1787564745973305344junit12 days ago
May 06 20:45:00.000 - 1s    E ns/openshift-image-registry route/test-disruption-new disruption/image-registry connection/new reason/DisruptionBegan ns/openshift-image-registry route/test-disruption-new disruption/image-registry connection/new stopped responding to GET requests over new connections: Get "https://test-disruption-new-openshift-image-registry.apps.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org/healthz": EOF
May 06 20:45:09.650 E ns/openshift-sdn pod/sdn-controller-8vzpr node/ip-10-0-177-94.us-west-1.compute.internal uid/076f2e1a-781a-453a-a522-a085b285e419 container/sdn-controller reason/ContainerExit code/2 cause/Error I0506 19:42:25.616957       1 server.go:27] Starting HTTP metrics server\nI0506 19:42:25.617051       1 leaderelection.go:248] attempting to acquire leader lease openshift-sdn/openshift-network-controller...\nE0506 19:49:03.539063       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: the server was unable to return a response in the time allotted, but may still be processing the request (get configmaps openshift-network-controller)\nE0506 19:49:38.234414       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.254.243:6443: connect: connection refused\nE0506 19:53:06.703763       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.254.243:6443: connect: connection refused\nE0506 19:58:48.993931       1 leaderelection.go:330] error retrieving resource lock openshift-sdn/openshift-network-controller: Get "https://api-int.ci-op-p4r1jp20-0e208.aws-2.ci.openshift.org:6443/api/v1/namespaces/openshift-sdn/configmaps/openshift-network-controller": dial tcp 10.0.254.243:6443: connect: connection refused\n
May 06 20:45:22.972 E ns/openshift-multus pod/multus-admission-controller-78d56bcdfc-flkgb node/ip-10-0-237-187.us-west-1.compute.internal uid/bdf01b7a-81ac-45a9-bfbb-99ce41737ca0 container/multus-admission-controller reason/ContainerExit code/137 cause/Error

Found in 92.59% of runs (250.00% of failures) across 27 total runs and 1 jobs (37.04% failed) in 114ms - clear search | chart view - source code located on github