Skip to content

[DOCS-10234] Test Health (new page) #28926

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

joepeeples
Copy link
Contributor

@joepeeples joepeeples commented Apr 23, 2025

What does this PR do? What is the motivation?

Test Health is a net-new tab within Test Optimization. It helps users (typically platform engineers or devops) track and improve their CI tests.

Merge instructions

Note

This PR intentionally ignores some Vale warnings: several headings use title case because they're matching labeled UI elements, and in one spot future tense actually seems appropriate because it's about a "yet to be identified" use case.

Merge readiness:

  • Ready for merge

For Datadog employees:
Merge queue is enabled in this repo. Your branch name MUST follow the <name>/<description> convention and include the forward slash (/). Without this format, your pull request will not pass in CI, the GitLab pipeline will not run, and you won't get a branch preview. Getting a branch preview makes it easier for us to check any issues with your PR, such as broken links.

If your branch doesn't follow this format, rename it or create a new branch and PR.

To have your PR automatically merged after it receives the required reviews, add the following PR comment:

/merge

Additional notes

@github-actions github-actions bot added the Architecture Everything related to the Doc backend label Apr 23, 2025
Copy link
Contributor

Preview links (active after the build_preview check completes)

New or renamed files

@joepeeples joepeeples requested a review from KickingTheTV April 23, 2025 18:00
@github-actions github-actions bot added the Images Images are added/removed with this PR label Apr 23, 2025
Copy link
Contributor

@KickingTheTV KickingTheTV left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@joepeeples joepeeples marked this pull request as ready for review May 5, 2025 19:50
@joepeeples joepeeples requested review from a team as code owners May 5, 2025 19:50
@drichards-87 drichards-87 added the editorial review Waiting on a more in-depth review label May 5, 2025
@drichards-87
Copy link
Contributor

Created DOCS-10793 for Docs Team review.

Copy link
Contributor

@jhgilbert jhgilbert left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved with minor comments, thank you!

Based on the current time frame and filters applied, the dashboard highlights the following key metrics:

- [**Pipelines Failed**](#pipelines-failed): Sum total of pipelines that failed due to flaky tests
- [**Time Wasted in CI**](#time-wasted-in-ci): Total time wasted in CI due to flaky tests
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- [**Time Wasted in CI**](#time-wasted-in-ci): Total time wasted in CI due to flaky tests
- [**Time Wasted in CI**](#time-wasted-in-ci): Total time spent in CI due to flaky tests

Just to avoid echoing the term in the definition

|--------|-------------|
| **Pipeline Executions with Tests** | Number of pipeline executions with one or more test sessions. |
| **Failures Due to Flaky Tests** | Number of pipeline executions that failed solely due to flaky tests. All tests that failed have one or more of the following tags: `@test.is_known_flaky` or `@test.is_new_flaky`. |
| **Failures Due to Non-Flaky Tests** | Number of pipeline executions that failed due to tests without any flakiness. All tests that failed do not have any of the following tags: `@test.is_known_flaky`, `@test.is_new_flaky`, and `@test.is_flaky`. |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
| **Failures Due to Non-Flaky Tests** | Number of pipeline executions that failed due to tests without any flakiness. All tests that failed do not have any of the following tags: `@test.is_known_flaky`, `@test.is_new_flaky`, and `@test.is_flaky`. |
| **Failures Due to Non-Flaky Tests** | Number of pipeline executions that failed due to tests without any flakiness. None of the failing tests have any of the following tags: `@test.is_known_flaky`, `@test.is_new_flaky`, and `@test.is_flaky`. |

### Enhance developer experience
Use **Dev Experience - Test Failure Breakdown** and **Dev Experience - Time Lost Breakdown** to identify how often flaky tests in particular cause failures and waste CI time.

These Test Optimization features improve developer experience by alleviating test failures and wasted time:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
These Test Optimization features improve developer experience by alleviating test failures and wasted time:
These Test Optimization features improve developer experience by reducing test failures and wasted time:

Just to use a less fancy word :)

- **[Test Impact Analysis][4]** minimizes the flaky tests that run, by only running relevant tests based on code coverage. Skipping irrelevant tests also shortens the feedback loop for developers.

### Maximize pipeline efficiency and reduce costs
Lengthy test suites slow down feedback loops to developers and running irrelevant tests incurs unnecessary costs.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Lengthy test suites slow down feedback loops to developers and running irrelevant tests incurs unnecessary costs.
Lengthy test suites slow down feedback loops to developers, and running irrelevant tests incurs unnecessary costs.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's not a big enough deal to fix, but this screenshot is on the small side in terms of readability. Next time, you might tap "Ctrl +" a time or two in order to cut down on the amount of white space in the shot, making the text bigger and easier to read without the image needing to be any larger.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Architecture Everything related to the Doc backend editorial review Waiting on a more in-depth review Images Images are added/removed with this PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants