Add custom informer for ManagedClusterAddOns #73

fxiang1 · 2025-10-16T14:53:08Z

https://issues.redhat.com/browse/ACM-24726

Add custom informer for ManagedClusterAddOns to reduce memory usage in large environments (2500 clusters)
Increase cache resync period to 10 mins

Signed-off-by: fxiang1 <[email protected]>

codecov · 2025-10-16T14:55:17Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 59.84%. Comparing base (3267edb) to head (0e04278).
⚠️ Report is 7 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #73      +/-   ##
==========================================
- Coverage   65.86%   59.84%   -6.02%     
==========================================
  Files           2        3       +1     
  Lines         706      924     +218     
==========================================
+ Hits          465      553      +88     
- Misses        218      340     +122     
- Partials       23       31       +8

Flag	Coverage Δ
unit	`59.84% <ø> (-6.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Signed-off-by: fxiang1 <[email protected]>

mikeshng · 2025-10-16T15:08:42Z

controllers/clusterpermission_controller.go

+	// Start the custom informer in a goroutine
+	go func() {
+		if err := customInformer.Start(); err != nil {
+			log.Log.Error(err, "Failed to start custom ManagedClusterAddOn informer")


We might need to exit/panic here or mca won't be watched and it will be hard to detect afterwards.
Does it make sense to retry a few times then exit out?

Thanks Mike! Yes, the AI suggested to retry.

Yes, it makes sense to add retry logic here! The informer's Start() method can fail if the cache doesn't sync, which could happen due to: 1. Temporary API server unavailability during startup 2. Network issues 3. The ManagedClusterAddOn CRD not being installed yet

mikeshng · 2025-10-16T15:11:10Z

controllers/clusterpermission_controller.go

+	// Call the reconcile function directly
+	_, err := r.Reconcile(ctx, req)
+	if err != nil {
+		log.Error(err, "Failed to reconcile ClusterPermission", "name", cp.Name, "namespace", cp.Namespace)


When reconcile error the general pattern should be requeue. I don't see it here or the caller side.

AI said since we are calling Reconcile manually from the custom informer event handler (not through the normal controller queue), we can only retry here. So AI added retries for this as well 😅

Signed-off-by: fxiang1 <[email protected]>

…r-management-io#73) * Red Hat Konflux update cluster-permission-acm-214 Signed-off-by: red-hat-konflux <[email protected]> * Bump Go to 1.23 Signed-off-by: fxiang1 <[email protected]> * Add microdnf update -y Signed-off-by: fxiang1 <[email protected]> --------- Signed-off-by: fxiang1 <[email protected]> Co-authored-by: red-hat-konflux <[email protected]> Co-authored-by: fxiang1 <[email protected]>

mikeshng

/approve

/lgtm

openshift-ci · 2025-10-16T19:00:31Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: fxiang1, mikeshng

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [fxiang1,mikeshng]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

fxiang1 · 2025-10-16T19:11:08Z

Merging manually, as codecov is not taking account of the e2e test coverage.

Add custom informer for ManagedClusterAddOns

b6b7e9e

Signed-off-by: fxiang1 <[email protected]>

openshift-ci bot requested review from mikeshng and xiangjingli October 16, 2025 14:53

openshift-ci bot added the approved label Oct 16, 2025

Add error handling

d95aa87

Signed-off-by: fxiang1 <[email protected]>

mikeshng reviewed Oct 16, 2025

View reviewed changes

fxiang1 added 3 commits October 16, 2025 11:50

Improve error handling

77d8a6c

Signed-off-by: fxiang1 <[email protected]>

Handle error

cc57a26

Signed-off-by: fxiang1 <[email protected]>

Handle event handler error

0e04278

Signed-off-by: fxiang1 <[email protected]>

mikeshng approved these changes Oct 16, 2025

View reviewed changes

openshift-ci bot assigned mikeshng Oct 16, 2025

openshift-ci bot added the lgtm label Oct 16, 2025

fxiang1 merged commit 49671db into open-cluster-management-io:main Oct 16, 2025
6 of 8 checks passed

fxiang1 deleted the feng-informer branch October 16, 2025 19:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add custom informer for ManagedClusterAddOns #73

Add custom informer for ManagedClusterAddOns #73

Uh oh!

fxiang1 commented Oct 16, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 16, 2025 •

edited

Loading

Uh oh!

mikeshng Oct 16, 2025

Uh oh!

fxiang1 Oct 16, 2025

Uh oh!

mikeshng Oct 16, 2025

Uh oh!

fxiang1 Oct 16, 2025

Uh oh!

mikeshng left a comment

Uh oh!

openshift-ci bot commented Oct 16, 2025

Uh oh!

fxiang1 commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Add custom informer for ManagedClusterAddOns #73

Add custom informer for ManagedClusterAddOns #73

Uh oh!

Conversation

fxiang1 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mikeshng Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

fxiang1 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

mikeshng Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

fxiang1 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

mikeshng left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-ci bot commented Oct 16, 2025

Uh oh!

fxiang1 commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fxiang1 commented Oct 16, 2025 •

edited

Loading

codecov bot commented Oct 16, 2025 •

edited

Loading