WIP: Trying hive. Take #3 ;-) #1331

rhopp · 2025-10-07T08:22:23Z

No description provided.

openshift-ci · 2025-10-07T08:22:35Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

openshift-ci · 2025-10-07T08:22:38Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rhopp

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~integration-tests/OWNERS~~ [rhopp]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coderabbitai · 2025-10-07T08:22:45Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

rhopp · 2025-10-07T13:08:50Z

/retest

abraverm · 2025-10-07T14:20:13Z

@rhopp continue to lurk on your great work 👍

rhopp · 2025-10-09T11:37:48Z

/retest

rhopp · 2025-10-10T04:52:25Z

/retest

rhopp · 2025-10-22T17:20:11Z

/retest

rhopp · 2025-10-23T07:11:38Z

/retest

rhopp · 2025-11-04T06:42:47Z

/retest

rhopp · 2025-11-07T12:24:59Z

/retest

rhopp · 2025-11-11T11:27:36Z

/retest

Add a 5-minute retry loop (30 attempts with 10-second intervals) to ensure successful login to the provisioned cluster using kubeadmin credentials. This handles cases where the cluster API is accessible but authentication may not be immediately ready. The retry loop includes proper validation via 'oc whoami' and integrates with the existing provisioning retry logic. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

The previous implementation had a race condition where 'oc whoami' would succeed immediately after login but fail moments later when called again. This caused intermittent authentication failures even though login was reported as successful. Changes: - Add 2-second wait after successful login to allow auth to propagate - Capture 'oc whoami' output once instead of calling it multiple times - Add additional verification step with 'oc version' to ensure cluster commands work - Improve error logging to show exit codes and output for debugging This should resolve the "Unauthorized" errors that occurred right after successful login (as seen in lines 399-405 of the previous run logs). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

The --short flag is not supported by the oc version command (unlike kubectl). Using 'oc get namespaces' instead provides better verification because: - It actually requires authentication and cluster access to succeed - oc version can show client version even without being logged in - This ensures we're truly authenticated and can access cluster resources 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Signed-off-by: Radim Hopp <[email protected]>

Add comprehensive stability monitoring to diagnose intermittent authorization failures that occur after successful cluster provisioning. This will help identify if the cluster becomes unstable over time or if there are specific patterns to the failures. The observation loop runs for 10 minutes (120 iterations at 5-second intervals) and tests three critical components: 1. Cluster Operators (oc get co) - validates cluster operator availability 2. Console URL accessibility - ensures the web console remains reachable 3. API Server (oc get namespaces) - verifies authentication and API access For each test, the script tracks: - Success/failure counts - Pattern string showing timeline (e.g., "SSSSSFFFSSSS" where S=success, F=failure) - Timestamped logs for any failures - Progress updates every ~100 seconds This diagnostic data will help determine: - If failures are sporadic or follow a pattern - Which component(s) are unstable - How long it takes for the cluster to stabilize - Whether the issue is authentication-specific or broader 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Add a new task to collect cluster artifacts when the pipeline fails. This task: - Runs in the finally section to execute even when other tasks fail - Only executes when pipeline status is not "Succeeded" - Logs into the provisioned cluster using the ocp-login-command - Runs gather-extra.sh script to collect diagnostic information - Pushes collected artifacts to OCI storage for later analysis The collected artifacts will help diagnose issues that occur during test execution, particularly the intermittent authorization failures. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

openshift-merge-robot · 2025-11-13T13:57:40Z

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

sonarqubecloud · 2025-11-14T13:41:15Z

Quality Gate passed

Issues
2 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

rhopp · 2025-11-18T09:22:08Z

/retest

konflux-ci-qe-bot · 2025-11-18T10:22:46Z

@rhopp: The following test has Failed, say /retest to rerun failed tests.

PipelineRun Name	Status	Rerun command	Build Log	Test Log
`e2e-4.19-dp2jk`	Failed	`/retest`	View Pipeline Log	View Test Logs

Inspecting Test Artifacts

To inspect your test artifacts, follow these steps:

Install ORAS (see the ORAS installation guide).
Download artifacts with the following commands:

mkdir -p oras-artifacts
cd oras-artifacts
oras pull quay.io/konflux-test-storage/rhtap-team/rhtap-cli:e2e-4.19-dp2jk

Test results analysis

OCI Artifact Browser URL

openshift-ci bot added the do-not-merge/work-in-progress label Oct 7, 2025

openshift-ci bot added the approved label Oct 7, 2025

rhopp mentioned this pull request Oct 7, 2025

WIP: Switch to hive from rosa #738

Closed

rhopp force-pushed the hive-try3 branch 2 times, most recently from 65c5e42 to 0f4afc8 Compare October 9, 2025 14:17

rhopp force-pushed the hive-try3 branch from 0f4afc8 to 4dfb0c1 Compare October 10, 2025 11:44

rhopp force-pushed the hive-try3 branch from 183d5e8 to 8349ffe Compare October 22, 2025 15:23

rhopp force-pushed the hive-try3 branch 3 times, most recently from 8a41531 to 8b6a645 Compare October 31, 2025 14:06

rhopp force-pushed the hive-try3 branch 2 times, most recently from 260dc09 to d07c279 Compare November 6, 2025 16:27

rhopp force-pushed the hive-try3 branch 2 times, most recently from a36fdcf to 70ad5ed Compare November 11, 2025 09:51

rhopp force-pushed the hive-try3 branch from 70ad5ed to b0c21a8 Compare November 13, 2025 08:51

rhopp added 3 commits November 13, 2025 12:25

WIP: Trying hive. Take #3 ;-)

001d329

fix result reference

d3144aa

increase timeout

0e98ff1

rhopp and others added 13 commits November 13, 2025 12:25

run just single pipeline

01b6d55

switch clusterpoo

f59d667

Increase some timeout ;-)

3113b13

Disable tls in tests

ed45ab6

Update tssc-test-image in e2e tests to use a specific self-signed image

a4b3bf0

try the convalescence logic multiple times

f3f38d8

Add CUSTOM_ROOT_CA var to gitops repo

baa6f2d

Update test image

e8dd330

Signed-off-by: Radim Hopp <[email protected]>

New version of tesplan

1def4a4

Signed-off-by: Radim Hopp <[email protected]>

rhopp force-pushed the hive-try3 branch from b0c21a8 to a44f93f Compare November 13, 2025 11:25

openshift-merge-robot added the needs-rebase label Nov 13, 2025

rhopp added 2 commits November 14, 2025 13:07

Add cert to proxy/cluster

20acde3

forgot to call the function :-(

6e88237

WIP: Trying hive. Take #3 ;-) #1331

Are you sure you want to change the base?

WIP: Trying hive. Take #3 ;-) #1331

Uh oh!

Conversation

rhopp commented Oct 7, 2025

Uh oh!

openshift-ci bot commented Oct 7, 2025

Uh oh!

openshift-ci bot commented Oct 7, 2025

Uh oh!

coderabbitai bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

rhopp commented Oct 7, 2025

Uh oh!

abraverm commented Oct 7, 2025

Uh oh!

rhopp commented Oct 9, 2025

Uh oh!

rhopp commented Oct 10, 2025

Uh oh!

rhopp commented Oct 22, 2025

Uh oh!

rhopp commented Oct 23, 2025

Uh oh!

rhopp commented Nov 4, 2025

Uh oh!

rhopp commented Nov 7, 2025

Uh oh!

rhopp commented Nov 11, 2025

Uh oh!

openshift-merge-robot commented Nov 13, 2025

Uh oh!

sonarqubecloud bot commented Nov 14, 2025

Quality Gate passed

Uh oh!

rhopp commented Nov 18, 2025

Uh oh!

konflux-ci-qe-bot commented Nov 18, 2025

Inspecting Test Artifacts

Test results analysis

OCI Artifact Browser URL

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

coderabbitai bot commented Oct 7, 2025 •

edited

Loading