Skip to content

Commit 0558d24

Browse files
committed
KEP 5075: update rollback strategy and SLIs
Signed-off-by: Sunyanan Choochotkaew <[email protected]>
1 parent 1d280ea commit 0558d24

File tree

1 file changed

+10
-7
lines changed
  • keps/sig-scheduling/5075-dra-consumable-capacity

1 file changed

+10
-7
lines changed

keps/sig-scheduling/5075-dra-consumable-capacity/README.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1072,8 +1072,8 @@ NOTE: Also set `disable-supported` to `true` or `false` in `kep.yaml`.
10721072
-->
10731073

10741074
Yes, this feature can be disabled once it has been enabled.
1075-
The `allowMultipleAllocations` flag, `sharingPolicy`, `capacityRequests`, and `consumedCapacities` fields will be dropped.
1076-
However, the `shareID` and renamed device (`<device id>/<share id>`) in device status needs to remain to keep the existing allocation result reference valid.
1075+
The `allowMultipleAllocations` flag, `sharingPolicy` and `capacityRequests` fields will be dropped.
1076+
However, the `shareID`, `consumedCapacities`, and renamed device (`<device id>/<share id>`) in device status needs to remain to keep the existing allocation result reference valid.
10771077

10781078
###### What happens if we reenable the feature if it was previously rolled back?
10791079

@@ -1209,16 +1209,19 @@ Existing DRA and related SLOs continue to apply.
12091209

12101210
###### What are the SLIs (Service Level Indicators) an operator can use to determine the health of the service?
12111211

1212-
Will consider in the beta timeframe.
1213-
12141212
<!--
12151213
Pick one more of these and delete the rest.
12161214
-->
12171215

1218-
- [ ] Metrics
1219-
- Metric name:
1216+
- [x] Metrics
1217+
- Metric names:
1218+
- `apiserver_request` with `resource="resourceclaims", subresource="status"`
1219+
- `scheduler_plugin_execution_duration_seconds` with `plugin="DynamicResources"`
1220+
- For state gathering, `extension_point="PreFilter"`
1221+
- For allocation, `extension_point="Filter"`
1222+
- For status update, `extension_point="PostFilter"`
12201223
- [Optional] Aggregation method:
1221-
- Components exposing the metric:
1224+
- Components exposing the metric: kube-apiserver, kube-scheduler
12221225
- [ ] Other (treat as last resort)
12231226
- Details:
12241227

0 commit comments

Comments
 (0)