Skip to content

[WIP] DRA e2e test to validate Nvidia gpu #29842

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

tkashem
Copy link
Contributor

@tkashem tkashem commented May 23, 2025

No description provided.

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 23, 2025
@openshift-ci openshift-ci bot requested review from flavianmissi and p0lyn0mial May 23, 2025 20:08
@openshift-ci openshift-ci bot added the vendor-update Touching vendor dir or related files label May 23, 2025
Copy link
Contributor

openshift-ci bot commented May 23, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: tkashem
Once this PR has been reviewed and has the lgtm label, please assign neisw for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Copy link

openshift-trt bot commented May 24, 2025

Job Failure Risk Analysis for sha: 65c8551

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-agnostic-ovn-cmd IncompleteTests
Tests for this run (19) are below the historical average (1657): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws IncompleteTests
Tests for this run (18) are below the historical average (3039): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-csi IncompleteTests
Tests for this run (17) are below the historical average (1722): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-disruptive IncompleteTests
Tests for this run (17) are below the historical average (1328): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn IncompleteTests
Tests for this run (18) are below the historical average (2895): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-cgroupsv2 IncompleteTests
Tests for this run (18) are below the historical average (2887): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-edge-zones IncompleteTests
Tests for this run (20) are below the historical average (3101): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-etcd-scaling IncompleteTests
Tests for this run (18) are below the historical average (1760): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-fips IncompleteTests
Tests for this run (17) are below the historical average (3123): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-image-registry IncompleteTests
Tests for this run (18) are below the historical average (1557): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-kube-apiserver-rollout IncompleteTests
Tests for this run (17) are below the historical average (1600): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-microshift IncompleteTests
Tests for this run (16) are below the historical average (1600): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-microshift-serial IncompleteTests
Tests for this run (16) are below the historical average (803): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-1of2 IncompleteTests
Tests for this run (18) are below the historical average (2005): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 IncompleteTests
Tests for this run (18) are below the historical average (1975): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-publicnet-1of2 IncompleteTests
Tests for this run (18) are below the historical average (2029): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-publicnet-2of2 IncompleteTests
Tests for this run (18) are below the historical average (1947): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node IncompleteTests
Tests for this run (17) are below the historical average (2811): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial IncompleteTests
Tests for this run (18) are below the historical average (1659): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-upgrade IncompleteTests
Tests for this run (19) are below the historical average (3759): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Showing 20 of 50 jobs analysis

Copy link
Contributor

openshift-ci bot commented May 27, 2025

@tkashem: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/okd-e2e-gcp 1a35f9b link false /test okd-e2e-gcp
ci/prow/e2e-vsphere-ovn 1a35f9b link true /test e2e-vsphere-ovn
ci/prow/e2e-openstack-serial 1a35f9b link false /test e2e-openstack-serial
ci/prow/e2e-aws-ovn-serial-1of2 1a35f9b link true /test e2e-aws-ovn-serial-1of2
ci/prow/e2e-aws-ovn-upgrade 1a35f9b link false /test e2e-aws-ovn-upgrade
ci/prow/e2e-metal-ipi-ovn-ipv6 1a35f9b link true /test e2e-metal-ipi-ovn-ipv6
ci/prow/e2e-aws-ovn-microshift 1a35f9b link true /test e2e-aws-ovn-microshift
ci/prow/e2e-gcp-fips-serial-1of2 1a35f9b link false /test e2e-gcp-fips-serial-1of2
ci/prow/e2e-aws-proxy 1a35f9b link false /test e2e-aws-proxy
ci/prow/e2e-aws-ovn 1a35f9b link false /test e2e-aws-ovn
ci/prow/verify-deps 1a35f9b link true /test verify-deps
ci/prow/e2e-aws-ovn-cgroupsv2 1a35f9b link false /test e2e-aws-ovn-cgroupsv2
ci/prow/4.12-upgrade-from-stable-4.11-e2e-aws-ovn-upgrade-rollback 1a35f9b link false /test 4.12-upgrade-from-stable-4.11-e2e-aws-ovn-upgrade-rollback
ci/prow/e2e-gcp-disruptive 1a35f9b link false /test e2e-gcp-disruptive
ci/prow/e2e-aws-ovn-microshift-serial 1a35f9b link true /test e2e-aws-ovn-microshift-serial
ci/prow/e2e-metal-ipi-virtualmedia 1a35f9b link false /test e2e-metal-ipi-virtualmedia
ci/prow/e2e-openstack-ovn 1a35f9b link false /test e2e-openstack-ovn
ci/prow/lint 1a35f9b link true /test lint
ci/prow/e2e-metal-ipi-ovn-kube-apiserver-rollout 1a35f9b link false /test e2e-metal-ipi-ovn-kube-apiserver-rollout
ci/prow/e2e-metal-ipi-ovn 1a35f9b link false /test e2e-metal-ipi-ovn
ci/prow/e2e-vsphere-ovn-etcd-scaling 1a35f9b link false /test e2e-vsphere-ovn-etcd-scaling
ci/prow/e2e-gcp-fips-serial-2of2 1a35f9b link false /test e2e-gcp-fips-serial-2of2
ci/prow/e2e-metal-ipi-ovn-dualstack-local-gateway 1a35f9b link false /test e2e-metal-ipi-ovn-dualstack-local-gateway
ci/prow/e2e-agnostic-ovn-cmd 1a35f9b link false /test e2e-agnostic-ovn-cmd
ci/prow/verify 1a35f9b link true /test verify
ci/prow/e2e-gcp-ovn 1a35f9b link true /test e2e-gcp-ovn
ci/prow/e2e-gcp-ovn-rt-upgrade 1a35f9b link false /test e2e-gcp-ovn-rt-upgrade
ci/prow/e2e-aws-ovn-edge-zones 1a35f9b link true /test e2e-aws-ovn-edge-zones
ci/prow/e2e-aws-ovn-single-node-upgrade 1a35f9b link false /test e2e-aws-ovn-single-node-upgrade
ci/prow/images 1a35f9b link true /test images
ci/prow/e2e-azure 1a35f9b link false /test e2e-azure
ci/prow/e2e-aws-disruptive 1a35f9b link false /test e2e-aws-disruptive
ci/prow/okd-scos-images 1a35f9b link true /test okd-scos-images
ci/prow/e2e-azure-ovn-upgrade 1a35f9b link false /test e2e-azure-ovn-upgrade
ci/prow/e2e-azure-ovn-etcd-scaling 1a35f9b link false /test e2e-azure-ovn-etcd-scaling
ci/prow/e2e-aws 1a35f9b link false /test e2e-aws
ci/prow/e2e-aws-ovn-etcd-scaling 1a35f9b link false /test e2e-aws-ovn-etcd-scaling
ci/prow/e2e-metal-ipi-serial-ovn-ipv6-1of2 1a35f9b link false /test e2e-metal-ipi-serial-ovn-ipv6-1of2
ci/prow/e2e-vsphere-ovn-dualstack-primaryv6 1a35f9b link false /test e2e-vsphere-ovn-dualstack-primaryv6
ci/prow/e2e-aws-ovn-fips 1a35f9b link true /test e2e-aws-ovn-fips
ci/prow/e2e-hypershift-conformance 1a35f9b link false /test e2e-hypershift-conformance
ci/prow/e2e-gcp-ovn-etcd-scaling 1a35f9b link false /test e2e-gcp-ovn-etcd-scaling
ci/prow/unit 1a35f9b link true /test unit
ci/prow/e2e-metal-ipi-serial-ovn-ipv6-2of2 1a35f9b link false /test e2e-metal-ipi-serial-ovn-ipv6-2of2
ci/prow/e2e-metal-ipi-ovn-dualstack 1a35f9b link false /test e2e-metal-ipi-ovn-dualstack
ci/prow/e2e-aws-ovn-single-node-serial 1a35f9b link false /test e2e-aws-ovn-single-node-serial
ci/prow/e2e-metal-ipi-serial-1of2 1a35f9b link false /test e2e-metal-ipi-serial-1of2
ci/prow/e2e-aws-ovn-image-registry 1a35f9b link true /test e2e-aws-ovn-image-registry
ci/prow/e2e-aws-csi 1a35f9b link false /test e2e-aws-csi
ci/prow/e2e-aws-ovn-kube-apiserver-rollout 1a35f9b link false /test e2e-aws-ovn-kube-apiserver-rollout
ci/prow/e2e-aws-ovn-single-node 1a35f9b link false /test e2e-aws-ovn-single-node
ci/prow/e2e-gcp-csi 1a35f9b link false /test e2e-gcp-csi
ci/prow/e2e-aws-ovn-serial-publicnet-2of2 1a35f9b link false /test e2e-aws-ovn-serial-publicnet-2of2
ci/prow/e2e-aws-ovn-serial-publicnet-1of2 1a35f9b link false /test e2e-aws-ovn-serial-publicnet-1of2
ci/prow/e2e-metal-ipi-serial-2of2 1a35f9b link false /test e2e-metal-ipi-serial-2of2
ci/prow/e2e-vsphere-ovn-upi 1a35f9b link true /test e2e-vsphere-ovn-upi
ci/prow/e2e-aws-ovn-serial-2of2 1a35f9b link true /test e2e-aws-ovn-serial-2of2
ci/prow/okd-scos-e2e-aws-ovn 1a35f9b link false /test okd-scos-e2e-aws-ovn
ci/prow/e2e-gcp-ovn-upgrade 1a35f9b link true /test e2e-gcp-ovn-upgrade

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Copy link

openshift-trt bot commented May 27, 2025

Job Failure Risk Analysis for sha: 1a35f9b

Job Name Failure Risk
pull-ci-openshift-origin-main-4.12-upgrade-from-stable-4.11-e2e-aws-ovn-upgrade-rollback IncompleteTests
Tests for this run (93) are below the historical average (255): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-agnostic-ovn-cmd IncompleteTests
Tests for this run (18) are below the historical average (1645): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws IncompleteTests
Tests for this run (18) are below the historical average (3038): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-csi IncompleteTests
Tests for this run (17) are below the historical average (1699): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-disruptive IncompleteTests
Tests for this run (18) are below the historical average (1358): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn IncompleteTests
Tests for this run (17) are below the historical average (2930): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-cgroupsv2 IncompleteTests
Tests for this run (18) are below the historical average (2928): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-edge-zones IncompleteTests
Tests for this run (20) are below the historical average (3043): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-etcd-scaling IncompleteTests
Tests for this run (17) are below the historical average (1743): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-fips IncompleteTests
Tests for this run (18) are below the historical average (3126): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-image-registry IncompleteTests
Tests for this run (18) are below the historical average (1580): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-kube-apiserver-rollout IncompleteTests
Tests for this run (18) are below the historical average (1641): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-microshift IncompleteTests
Tests for this run (15) are below the historical average (1607): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-microshift-serial IncompleteTests
Tests for this run (15) are below the historical average (801): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-1of2 IncompleteTests
Tests for this run (18) are below the historical average (1979): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 IncompleteTests
Tests for this run (17) are below the historical average (1950): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-publicnet-1of2 IncompleteTests
Tests for this run (18) are below the historical average (1944): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-publicnet-2of2 IncompleteTests
Tests for this run (18) are below the historical average (1867): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node IncompleteTests
Tests for this run (17) are below the historical average (2764): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial IncompleteTests
Tests for this run (18) are below the historical average (1552): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Showing 20 of 51 jobs analysis

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. vendor-update Touching vendor dir or related files
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant