-
Couldn't load subscription status.
- Fork 88
Proposal:add UnhealthyNodeNames feild in HyperNode status and common HyperNodeConditionType #155
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: network-topology-dev
Are you sure you want to change the base?
Proposal:add UnhealthyNodeNames feild in HyperNode status and common HyperNodeConditionType #155
Conversation
Signed-off-by: fishingfly <[email protected]>
Signed-off-by: fishingfly <[email protected]>
|
Welcome @fishingfly! |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
Hi @Monokaix |
…tring Signed-off-by: fishingfly <[email protected]>
… detail Signed-off-by: fishingfly <[email protected]>
|
/lgtm |
What is the problem you're trying to solve
SpecandStatusfields on the HyperNode. Currently, theStatusfield only containsConditionsandNodeCount.Conditionsfield.Conditionsfield reflects the overall status of the HyperNode, which may be connected to multiple nodes. However, there is no existing mechanism to indicate the health status of individual nodes. This lack of granularity prevents the scheduler from accurately identifying and handling unhealthy nodes.HyperNodeStatus.Conditions.Describe the solution you'd like
Status, such asUnhealthyNodeNames, to explicitly list nodes that are currently unschedulable under the given HyperNode.HyperNodeSystemFailure: Indicates a system-level issue on the switch or tor, such as CPU or memory overload, power failure, fan malfunction, or other critical system faults.HyperNodeNetworkUnavailable: Indicates a network-related issue on the switch or tor, such as abnormal link status, interface failures, or other network disruptions.Expected HyperNode Structure
The final HyperNode status should incorporate these enhancements to provide granular node-level health insights and common switch condition types, improving scheduler awareness and resource allocation efficiency.
The final expected HyperNode is as follows: