CNV-67916: hypershift, kubevirt, add enhancement for IP management #1855

qinqon · 2025-10-07T10:19:52Z

This enhancement proposes adding static IP address management (IPAM) capabilities to HyperShift when using the KubeVirt provider and the KubeVirt nodepool is not attached to default pod network using a multus layer2 network to replace it.

The implementation enables operators to define IP pools and network configurations that are automatically allocated to virtual machines during cluster provisioning. This functionality addresses the need for predictable, static network configuration in environments where DHCP is not available or desirable, particularly in on-premises and edge deployments.

https://issues.redhat.com/browse/CNV-67916

openshift-ci · 2025-10-07T10:20:06Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign derekwaynecarr for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

enhancements/hypershift/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci-robot · 2025-10-07T10:23:24Z

@qinqon: This pull request references CNV-67916 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the epic to target the "4.21.0" version, but no target version was set.

In response to this:

This enhancement proposes adding static IP address management (IPAM) capabilities to HyperShift when using the KubeVirt provider and the KubeVirt nodepool is not attached to default pod network using a multus layer2 network to replace it.

The implementation enables operators to define IP pools and network configurations that are automatically allocated to virtual machines during cluster provisioning. This functionality addresses the need for predictable, static network configuration in environments where DHCP is not available or desirable, particularly in on-premises and edge deployments.

https://issues.redhat.com/browse/CNV-67916

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

enhancements/hypershift/kubevirt-static-ip-address-management.md

qinqon · 2025-10-14T15:26:46Z

/cc @maiqueb

qinqon · 2025-10-15T07:15:12Z

/cc @orenc1

celebdor · 2025-10-27T09:25:24Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+    // Addresses specify IP ranges from which addresses will be allocated for this interface.
+    // Supports CIDR notation, hyphenated ranges, and single IPs.
+    // +required
+	Addresses []string `json:"addresses"`


This should probably use hostedcluster_types things like CIDRBlock or ipNet, or maybe define a new one that can be CEL validated if it is really necessary to have ranges and single IPs

This should probably use hostedcluster_types things like CIDRBlock or ipNet, or maybe define a new one that can be CEL validated if it is really necessary to have ranges and single IPs

Since this can be a cidr, ip range or single IP I cannot use those types, so this has to be a string slice, I will add a CEL validation.

celebdor · 2025-10-27T09:30:27Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+
+**Management Cluster Components**:
+- Cluster API Provider KubeVirt controller (KubevirtMachineTemplate controller): Enhanced with IP allocation logic
+- HyperShift operator (NodePool controller): Updated to handle network configuration in NodePool spec


Won't it also be necessary to update the ignition generation for the the netconfig?

Won't it also be necessary to update the ignition generation for the the netconfig?

The fact that we implmenet "IP allocation logic" using virtual machine cloud init network data it's implementation details, so it's part of the Cluster API Provider Kubevirt controller changes

celebdor · 2025-10-27T09:32:42Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+
+- Adds complexity to the hypershift kubevirt controllers:
+  - implementing and ip pool mechanism
+  - Needs to parse and understand the openshift network config format


s/openshift/openstack/ ?

s/openshift/openstack/ ?

Right, I will also add the it should unerstand either openstack or netplan depending of the kind of networkData we want to use either configdrive or nocloud

celebdor · 2025-10-27T09:34:01Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+- Symptom: New KubevirtMachine resources remain in Pending state
+- Detection: Check KubevirtMachine events and status conditions
+- Log output: "No available IPs in pool for interface X"
+- Metric: `kubevirt_ippool_exhausted{cluster="X"}`


Is this going to be a status condition on the NodePool?

Is this going to be a status condition on the NodePool?

It should, I will add that

maiqueb

Thank you.

maiqueb · 2025-10-27T08:31:33Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+
+- Enable static IP address assignment for HyperShift guest cluster nodes running on KubeVirt
+- Provide a flexible API for defining IP address pools with support for various IP range formats (CIDR, hyphenated ranges, single IPs)
+- Support both IPv4 and IPv6 addressing


so the kubevirt VMs which are actually the hosted cluster nodes can be ipv6 nodes ?

do we support single stack ipv6 ? or only dual-stack ?

Just trying to figure out the test matrix we need to have later on.

so the kubevirt VMs which are actually the hosted cluster nodes can be ipv6 nodes ?

do we support single stack ipv6 ? or only dual-stack ?

Just trying to figure out the test matrix we need to have later on.

For hyperhsift kubevirt we don't use KubeVirt DHCP mechanism we use OVN DHCPOptions instead this allow us to support single stack ipv4/ipv6 and dual stack.

but isn't this about localnet secondary interfaces ?

AFAICT we do not have OVN DHCPOpts for localnet secondaries - only on the pod network (masquerade or primary UDN attachments).

but isn't this about localnet secondary interfaces ?

AFAICT we do not have OVN DHCPOpts for localnet secondaries - only on the pod network (masquerade or primary UDN attachments).

Right, I mixed things up, so since this is static IP assignment and we don't depend on KubeVirt DHCP capabilities we support sintle stack (both ipv4 and ipv6) and dual stack.

OK, that's aligned with my expectations.

OK, so the test matrix will be complex. Do we need to have anything in origin ?...

OK, that's aligned with my expectations.

OK, so the test matrix will be complex. Do we need to have anything in origin ?...

At origin we test dual stack

enhancements/hypershift/kubevirt-static-ip-address-management.md

maiqueb · 2025-10-27T10:09:58Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+Migrating nodepools nodes from dynamic to static IPs is not expected, those nodes
+will be reconstructed and new ip assigned.


I'm sorry, but I don't follow this.

I'm sorry, but I don't follow this.

I will reword this

maiqueb · 2025-10-27T10:13:12Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+- BootstrapNetworkConfig in KubevirtMachineSpec (cluster-api-provider-kubevirt)
+- BootstrapNetworkConfig in nodepool's KubevirtPlatformSpec (hypershift)


I'm forced to say one of the most confusing things to me (I'm not HCP savvy) was to differentiate between these two CRDs.

When I would - as a hosted cluster "owner" - use one of these versus the other ?

Which of these should I use and why ? Or do I need to use both ?

I'm forced to say one of the most confusing things to me (I'm not HCP savvy) was to differentiate between these two CRDs.

When I would - as a hosted cluster "owner" - use one of these versus the other ?

Which of these should I use and why ? Or do I need to use both ?

At hypershift you use the NodePool CRD, KubevirtMachine is internal to hypershift.

maiqueb · 2025-10-27T10:14:46Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+1. **IP Pool Exhaustion**: Machine creation fails with clear error. Operator must expand pool or delete unused machines.
+2. **Invalid Network Configuration**: HostedCluster creation fails validation. Operator must correct configuration.
+3. **Network Config Application Failure**: Node fails to boot or is unreachable. Visible in VM console logs. Operator must verify network configuration correctness.
+4. **IP Allocation Conflict**: Different nodepools using same network but with overlapping IPs should fail with clear error. Operator should fix the subnets.


why should this matter ? The way I see it, this would become an issue if the node pools are being used at the same time in the same node

why should this matter ? The way I see it, this would become an issue if the node pools are being used at the same time in the same node

Nodepools cannot be used for the same node, since they are definining those nodes, issue is that one nodepool can contribute to the ip exhaustion of the other if they have overlapping subnets.

what I don't follow is how can one network have multiple node pools.

Can you point me towards documentation about the node pool ? What does it reflect, and how it relates to the network ?

So the nodepool kubevirt specific api hass the additionalNetworks field, to specify the NAD name, this enhancement is adding the IPAM configuration at same API level, and a hosted cluster can have multiple nodepools, this means multiple kubevirt nodepools can have conflicting IPAM configuration, and that's what we want to detect.

maiqueb · 2025-10-27T10:18:06Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+- Resolution: Verify OpenStack network config format correctness, check gateway/DNS reachability
+
+**IP Allocation Errors**:
+- Symptom: Machine creation fails or succeeds with incorrect IP


is it realistic to include incorrect IP allocation in this metric ?

Asking because (in my own limited mental model) we could prevent this, instead of capturing it in a metric.

Couldn't we ?

is it realistic to include incorrect IP allocation in this metric ?

Asking because (in my own limited mental model) we could prevent this, instead of capturing it in a metric.

Couldn't we ?

Agree since we are already using CEL to ensure propre IPs are configured at the addresses

is it realistic to include incorrect IP allocation in this metric ?

Asking because (in my own limited mental model) we could prevent this, instead of capturing it in a metric.

Couldn't we ?

Agree since we are already using CEL to ensure propre IPs are configured at the addresses, I will remove it

maiqueb · 2025-10-27T10:21:17Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+- New nodes will use DHCP
+- No impact on running workloads


not so fast - I'm not so sure this is this easy / there aren't impacts.

Let's say you're using static IPs, then you stop using it. You remove the NetworkConfig from the template spec.

Your existing nodes with static IPs will continue to function.

a new node will gets its IP via DHCP

what will prevent this new node from getting an IP which is already in use by the static IPs counter part ?

AFAIU, you don't have anything to map the static IPs from the node pool to the DHCP pool availability.

not so fast - I'm not so sure this is this easy / there aren't impacts.

Let's say you're using static IPs, then you stop using it. You remove the NetworkConfig from the template spec.

Your existing nodes with static IPs will continue to function.

a new node will gets its IP via DHCP

what will prevent this new node from getting an IP which is already in use by the static IPs counter part ?

AFAIU, you don't have anything to map the static IPs from the node pool to the DHCP pool availability.

Even though I think we should not support this, it may work if NodePool is changed so bootstrap network config is removed and virtual machines are restarted.

jparrill

Things to have in mind:

This implementation will touch CPO and HO at Hypershift level, so we need to limit the Hypershift API changes to only work on newest versions of CPO.
Validations at multiple levels to avoid issues with the conflicting IPs.
How config errors will fail at reconciliation level? (block? fail and continue?) And at what level? (at HO (for NodePools, this could short circuit the reconciliation of other HCs) and at CPO (for internal issues))
How config errors will be communicated to the MGMT cluster administrator.
Dropped some other questions in the review too

jparrill · 2025-11-12T08:34:27Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+This proposal introduces static IPAM capabilities through modifications to three key components:
+
+1. **Cluster API Provider KubeVirt**: Extend the API to support IP pool definitions and network configuration, implement IP allocation logic from defined pools
+2. **HyperShift**: Add network configuration options to the KubeVirt platform specification at nodepool, implement translation from this network configuration to capk ip pool and openstack network config


Is this new API customizable day-0 or day-0 and day-2?

Is this new API customizable day-0 or day-0 and day-2?

It's going to be inmmutable, I need to update the enhancement with that, if we suffer from ip exhaustion customer can create new nodpool with bigger subnet.

jparrill · 2025-11-12T08:36:27Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+
+This proposal introduces static IPAM capabilities through modifications to three key components:
+
+1. **Cluster API Provider KubeVirt**: Extend the API to support IP pool definitions and network configuration, implement IP allocation logic from defined pools


What about the overlapping? Is that allowed? (IIRC this is relevant at machineNetwork level):

Between different NPs

Between MGMT and HC

Between different HCs

What about the overlapping? Is that allowed? (IIRC this is relevant at machineNetwork level):

Between different NPs

Overlapping is check by network, so NodePools at same network are not allow to overlap, but are allowed if they are at
different networks

Between MGMT and HC

We don't check that, it's up to the customer when they decide the IPAM config at nodepool to not collide with their infra

Between different HCs

That's up to the customer, in case they want to share network between HCs they should be sure to not collide, from what we have gather that is good enough.

jparrill · 2025-11-12T08:37:51Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+
+1. **Cluster API Provider KubeVirt**: Extend the API to support IP pool definitions and network configuration, implement IP allocation logic from defined pools
+2. **HyperShift**: Add network configuration options to the KubeVirt platform specification at nodepool, implement translation from this network configuration to capk ip pool and openstack network config
+3. **CoreOS Afterburn**: Enable parsing and application of OpenStack or netplan network config standard data as dracut network kernel args from cloud-init (config drive or nocloud) for KubeVirt VMs, this is similar what is done at [proxmoxve provider](https://github.com/coreos/afterburn/pull/1023).


Could this have issues with conflicting NADs already deployed in the field?

Could this have issues with conflicting NADs already deployed in the field?

This is going to configure virtual machine networking statically, so possible nads affecting the primary interface will be ignored.

jparrill · 2025-11-12T08:41:39Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+#### Failure Handling
+
+- **IP Pool Exhaustion**: If all IPs in the pool are allocated, machine creation will fail with a clear error message indicating pool exhaustion at the NodePool CRD status with a condition like `KubevirtIPPoolExhausted=true`
+- **Invalid Network Configuration**: The API will validate network configuration format during HostedCluster creation, rejecting invalid configurations, using CEL is a perfect fit to implement this.


We need to implement this at 2 levels:

CEL first as a api-machinery protection layer

NetworkValidations at Hypershift level, in order to avoid conflicts regarding internal rules

jparrill · 2025-11-12T08:50:52Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+      "id": "network0",
+      "type": "ipv4",
+      "link": "eno101",
+      "ip_address": "192.168.1.100", <-- this is the part allocated by "cluster api provider kubevirt controller"


IIUC This should be validated in 3 phases:

CEL

Hypershift Networking validations

Openstack provider validation (to avoid conflicting allocations)

IIUC This should be validated in 3 phases:

CEL

Hypershift Networking validations

The ip address is generated by cluster-api-provider-kubevirt so we don't need to validate it.

Openstack provider validation (to avoid conflicting allocations)

This is already merged at fcos and will be part of rhcos

jparrill · 2025-11-12T08:56:06Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+- It will be supported for ovn-kubernetes team, not only hypershift kubevirt maintainers.
+
+Cons:
+- No clear solution for implementing IPv6 RAs since there is no logical router.


Maybe an extra component like RaDVD deployed only if the Kubevirt cluster is IPv6 based could help. I don't know if Konnectivity handles the RAs properly through the tunnel to guest side.

Maybe an extra component like RaDVD deployed only if the Kubevirt cluster is IPv6 based could help. I don't know if Konnectivity handles the RAs properly through the tunnel to guest side.

The point is just that there is not a clear solution for that problem, we don't need to find wich one.

jparrill · 2025-11-12T08:58:32Z

enhancements/hypershift/kubevirt-static-ip-address-management.md

+**Performance Testing**:
+- IP allocation performance with large pools (1000+ addresses)
+- Scale testing with 100+ node clusters
+


Do you have any hint about a disaster recovery scenario? (EG: Conflicting IP entries among diff HCs on restoration)

Do you have any hint about a disaster recovery scenario? (EG: Conflicting IP entries among diff HCs on restoration)

We are going to re-fill the in memory allocator cache at capi pod's restart so we ensure that we calculate conflicts correctly.

AmrGanz · 2025-11-12T09:21:50Z

Having a choice of setting a Static Network Subnet for each nodePool is a great feature to have!

I hope that we also make sure that those Subents don't overlap between multiple nodePools (within one HCP) and I believe that this was taken into consideration too already.

qinqon · 2025-11-12T09:30:02Z

Having a choice of setting a Static Network Subnet for each nodePool is a great feature to have!

I hope that we also make sure that those Subents don't overlap between multiple nodePools (within one HCP) and I believe that this was taken into consideration too already.

@AmrGanz is having the IPAM configuration inmutable ok for you ? if you suffer from not enough IPs you can always create another nodepool with new range not colliding with the previous one.

AmrGanz · 2025-11-12T09:33:31Z

Having a choice of setting a Static Network Subnet for each nodePool is a great feature to have!
I hope that we also make sure that those Subents don't overlap between multiple nodePools (within one HCP) and I believe that this was taken into consideration too already.

@AmrGanz is having the IPAM configuration inmutable ok for you ? if you suffer from not enough IPs you can always create another nodepool with new range not colliding with the previous one.

@qinqon IF we can allow increasing the range of the already configured Subent that would be great, but this will again require checking the new range against the used ones in other nodePools.
So, if this will require a long time to be implemented, then for now it would be OK to make it Immutable and follow your suggestion if they want to change/increase the already configured subnet

Signed-off-by: Enrique Llorente <[email protected]>

openshift-ci · 2025-11-13T11:37:29Z

@qinqon: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/markdownlint	`5acd494`	link	true	`/test markdownlint`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

openshift-ci bot requested review from derekwaynecarr and sjenning October 7, 2025 10:20

qinqon changed the title ~~hypershift, kubevirt: Add enhancement for IP management~~ CNV-67916: hypershift, kubevirt, add enhancement for IP management Oct 7, 2025

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Oct 7, 2025

phoracek suggested changes Oct 7, 2025

View reviewed changes

qinqon force-pushed the hypershift-kubevirt-ipam branch 4 times, most recently from b8d3daf to b6cb5a1 Compare October 9, 2025 09:08

qinqon requested a review from phoracek October 9, 2025 09:16

openshift-ci bot requested a review from maiqueb October 14, 2025 15:26

openshift-ci bot requested a review from orenc1 October 15, 2025 07:15

celebdor suggested changes Oct 27, 2025

View reviewed changes

maiqueb reviewed Oct 27, 2025

View reviewed changes

qinqon force-pushed the hypershift-kubevirt-ipam branch 2 times, most recently from 908013f to 5db1dd5 Compare October 28, 2025 13:59

qinqon requested review from celebdor and maiqueb October 28, 2025 14:01

qinqon force-pushed the hypershift-kubevirt-ipam branch 5 times, most recently from c7dcbdb to 8287292 Compare November 3, 2025 14:04

qinqon force-pushed the hypershift-kubevirt-ipam branch from 8287292 to 7be01ce Compare November 12, 2025 08:43

jparrill reviewed Nov 12, 2025

View reviewed changes

qinqon force-pushed the hypershift-kubevirt-ipam branch 7 times, most recently from ed34ce1 to e1c2764 Compare November 13, 2025 10:26

hypershift, kubevirt: Add enhancement for IP management

5acd494

Signed-off-by: Enrique Llorente <[email protected]>

qinqon force-pushed the hypershift-kubevirt-ipam branch from e1c2764 to 5acd494 Compare November 13, 2025 11:16

		Migrating nodepools nodes from dynamic to static IPs is not expected, those nodes
		will be reconstructed and new ip assigned.

		- BootstrapNetworkConfig in KubevirtMachineSpec (cluster-api-provider-kubevirt)
		- BootstrapNetworkConfig in nodepool's KubevirtPlatformSpec (hypershift)


		This proposal introduces static IPAM capabilities through modifications to three key components:

		1. Cluster API Provider KubeVirt: Extend the API to support IP pool definitions and network configuration, implement IP allocation logic from defined pools

CNV-67916: hypershift, kubevirt, add enhancement for IP management #1855

Are you sure you want to change the base?

CNV-67916: hypershift, kubevirt, add enhancement for IP management #1855

Uh oh!

Conversation

qinqon commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openshift-ci bot commented Oct 7, 2025

Uh oh!

openshift-ci-robot commented Oct 7, 2025 • edited by openshift-ci bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinqon commented Oct 14, 2025

Uh oh!

qinqon commented Oct 15, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maiqueb left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qinqon commented Oct 7, 2025 •

edited

Loading

openshift-ci-robot commented Oct 7, 2025 •

edited by openshift-ci bot

Loading

qinqon Nov 12, 2025 •

edited

Loading

qinqon Nov 12, 2025 •

edited

Loading