-
Notifications
You must be signed in to change notification settings - Fork 99
Open
Labels
Description
We have issues with pods being killed and rescheduled in busier environments. Unfortunately postgres is just as likely to be killed as any other worker pods. After a discussion with @Fryguy and @jrafanie we think the design should be as follows:
- Add RBAC permissions for the operator to read, list and write
priorityClassNames - Add 3 items to the CRD for high, medium and low priorityClassName values
- Assign class name values as follows:
- If all values are specified in CR, use them
- If no values are set, detect the cluster default. Set low to cluster default, medium = low + 100, high = medium + 100
- Validate that values are reasonable:
- High should not be more than 1,000,000,000 (use CRD JSON schema validation)
- Error if high, medium & low are out of order (code validation)
- Warn if low is less than cluster default? Warn if low is less than 0? (code validation)
- Assign pod priorities:
- High: postgres, memcached, kafka, httpd
- Medium: UI & API, orchestrator, maybe operators if possible (may not work if the class names don't exist yet)
- Low: all other workers
Fryguy
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
To do