ci(e2e): classify staging e2e results into a structured digest by jacekradko · Pull Request #8760 · clerk/javascript

jacekradko · 2026-06-05T03:58:37Z

The report job posted a bare red circle on any gating failure with no detail about what actually broke. This adds a classifier that turns each run into a structured digest.

It parses every leg's Playwright JSON report (uploaded since #8756) and buckets each failed test by error signature: FAPI 429 and handshake/JWT clock-skew are infra-flake; everything else is a candidate regression; and passed-on-retry tests are reported as flaky. Unknown signatures default to candidate regression, never the reverse, so a real break can't hide as infra. The report job writes the digest to the job summary on every run (so the informational generic leg's breakdown is always visible) and posts that same digest to Slack on a gating-leg failure. The digest looks like:

:red_circle: Staging E2E: gating failure
_ref main · sdk latest · clerk_go abc1234_
• ❌ smoke: 1 candidate regression
• ❌ generic (informational): 2 candidate regressions, 14 infra-flake, 5 flaky

The classifier is a standalone Node script with unit tests, and it never fails the build (an empty or unreadable reports directory just yields an "all green" digest). The Slack trigger is unchanged (needs.integration-tests.result == 'failure', i.e. a gating-leg failure), so this is a strictly richer message, not a noisier one.

Two pieces are deliberately deferred: persisting per-test history to distinguish new from sustained failures (which would let the informational generic leg's regressions page a human rather than just appear in the summary), and wiring the clerk_go commit status to the smoke gate, which the plan says to hold until the smoke leg is demonstrably green. Stacked on #8759.

changeset-bot · 2026-06-05T03:58:42Z

🦋 Changeset detected

Latest commit: 1510b55

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 0 packages

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

vercel · 2026-06-05T03:58:43Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
clerk-js-sandbox	Ready	Preview, Comment	Jun 5, 2026 11:44am

coderabbitai · 2026-06-05T03:58:43Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Repository YAML (base), Repository UI (inherited)

Review profile: CHILL

Plan: Pro

Run ID: 668abfe9-f9b9-44ee-b057-bb050ffbdfc1

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

The report job posted a bare red circle on any gating failure with no detail. Add a classifier that parses every leg's Playwright JSON report and buckets each failed test by error signature: FAPI 429 and handshake/JWT clock-skew are infra-flake, everything else (unknown signatures included, never the reverse) is a candidate regression, and passed-on-retry tests are reported as flaky. The report job now downloads the per-leg reports, writes the classified digest to the job summary on every run (so the informational generic leg's breakdown is always visible), and posts that same digest to Slack on a gating-leg failure instead of the old opaque message. Reporting never fails the build. Deferred: persisting per-test history to alert on new-vs-sustained failures (which would let the informational generic leg's regressions page a human), and wiring the clerk_go commit status to the smoke gate.

github-actions Bot added the actions label Jun 5, 2026

vercel Bot deployed to Preview June 5, 2026 03:59 View deployment

jacekradko force-pushed the jacek/staging-e2e-smoke-leg branch from 3d2c6ea to db5acc0 Compare June 5, 2026 11:42

jacekradko force-pushed the jacek/staging-e2e-reporting branch from ac9a8f8 to 1510b55 Compare June 5, 2026 11:43

vercel Bot deployed to Preview June 5, 2026 11:44 View deployment

jacekradko mentioned this pull request Jun 5, 2026

ci(e2e): upload staging Playwright JSON report from integration/ #8766

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci(e2e): classify staging e2e results into a structured digest#8760

ci(e2e): classify staging e2e results into a structured digest#8760
jacekradko wants to merge 1 commit into
jacek/staging-e2e-smoke-legfrom
jacek/staging-e2e-reporting

jacekradko commented Jun 5, 2026

Uh oh!

changeset-bot Bot commented Jun 5, 2026 •

edited

Loading

Uh oh!

vercel Bot commented Jun 5, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented Jun 5, 2026 •

edited

Loading

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jacekradko commented Jun 5, 2026

Uh oh!

changeset-bot Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

vercel Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

changeset-bot Bot commented Jun 5, 2026 •

edited

Loading

vercel Bot commented Jun 5, 2026 •

edited

Loading

coderabbitai Bot commented Jun 5, 2026 •

edited

Loading