Create a new monitor for node-level write load #131560

DiannaHohensee · 2025-07-18T21:03:59Z

Plugs a basic NodeUsageStatsForThreadPoolsMonitor
into the code and sets it up with access to some
components it will need to implement ES-11992.

Relates ES-11991

Plugs a basic NodeUsageStatsForThreadPoolsMonitor into the code and sets it up with access to information it will need to implement ES-11992. Relates ES-11991

elasticsearchmachine · 2025-07-18T21:04:24Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

mhl-b

LGTM with nit naming

mhl-b · 2025-07-21T18:13:05Z

...n/java/org/elasticsearch/cluster/routing/allocation/NodeUsageStatsForThreadPoolsMonitor.java

+ *
+ * TODO (ES-11992): implement
+ */
+public class NodeUsageStatsForThreadPoolsMonitor {


Can be more specific, for example WriteLoadMonitor. I think it should monitor write-load in general, which can be a set of different metrics - write-thread-pool utilization, index/shard write load.

Sure, thanks. I've renamed it WriteLoadConstraintMonitor, to follow the existing naming pattern we have.

NodeUsageStatsForThreadPools has been used generically for the thread pool usage, even though we only care about write load right now, because we'll presumably have search load in future. But I expect any monitoring added for search in future should go into a separate monitor class for maintainability reasons.

cla-checker-service · 2025-07-21T19:01:22Z

❌ Author of the following commits did not sign a Contributor Agreement:
9bcf8ab

Please, read and sign the above mentioned agreement if you want to contribute to this project

mhl-b · 2025-07-21T19:08:46Z

...r/src/main/java/org/elasticsearch/cluster/routing/allocation/WriteLoadConstraintMonitor.java

+ *
+ * TODO (ES-11992): implement
+ */
+public class WriteLoadConstraintMonitor {


We can follow disk allocator naming too, WriteLoadThresholdSettings and WriteLoadThresholdMonitor. I dont see reason for Constraint or Threshold wording here, it's implicit in the name of Monitor. If we monitor something that means there is a value and threshold for this value.

I was hoping to convey the notion of resource constrained shard allocation decisions.

Monitor by itself doesn't imply thresholds or constraints. It says we're watching for something. In the case of WriteLoadMonitor, we're monitoring write load, but we aren't saying for what.

A significant part of this is simply consistency in naming. For example, you can find all the DiskThreshold* files with the same prefix. Similarly here, WriteLoadConstraint*. If we settle on something other than the WriteLoadConstraint* prefix, I'm inclined to put that in a separate PR to rename the other file as well. The new heap decider logic also follows a prefix pattern -- they in fact did a followup PR for renaming.

Went ahead with pushing this so as not to block ES-11992.

Create a new monitor for node-level write load

a1899a3

Plugs a basic NodeUsageStatsForThreadPoolsMonitor into the code and sets it up with access to information it will need to implement ES-11992. Relates ES-11991

DiannaHohensee self-assigned this Jul 18, 2025

DiannaHohensee requested a review from a team as a code owner July 18, 2025 21:04

DiannaHohensee added >non-issue :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) Team:Distributed Coordination Meta label for Distributed Coordination team v9.2.0 labels Jul 18, 2025

[CI] Auto commit changes from spotless

9bcf8ab

DiannaHohensee requested a review from mhl-b July 18, 2025 21:17

mhl-b approved these changes Jul 21, 2025

View reviewed changes

DiannaHohensee added 2 commits July 21, 2025 12:01

rename class

14454c1

Merge branch 'main' into 2025/07/18/node-thread-pool-monitor

19460b5

comment touch up

8bd66fe

DiannaHohensee added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jul 21, 2025

DiannaHohensee mentioned this pull request Jul 21, 2025

(WIP) Collect node thread pool usage for shard balancing #131249

Closed

mhl-b reviewed Jul 21, 2025

View reviewed changes

DiannaHohensee merged commit 79bdc32 into elastic:main Jul 22, 2025
32 of 33 checks passed

DiannaHohensee deleted the 2025/07/18/node-thread-pool-monitor branch July 22, 2025 21:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create a new monitor for node-level write load #131560

Create a new monitor for node-level write load #131560

DiannaHohensee commented Jul 18, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Jul 18, 2025

Uh oh!

mhl-b left a comment

Uh oh!

mhl-b Jul 21, 2025

Uh oh!

DiannaHohensee Jul 21, 2025

Uh oh!

cla-checker-service bot commented Jul 21, 2025

Uh oh!

mhl-b Jul 21, 2025 •

edited

Loading

Uh oh!

DiannaHohensee Jul 21, 2025

Uh oh!

DiannaHohensee Jul 22, 2025

Uh oh!

Uh oh!

Uh oh!

Create a new monitor for node-level write load #131560

Create a new monitor for node-level write load #131560

Conversation

DiannaHohensee commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jul 18, 2025

Uh oh!

mhl-b left a comment

Choose a reason for hiding this comment

Uh oh!

mhl-b Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

cla-checker-service bot commented Jul 21, 2025

Uh oh!

mhl-b Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

DiannaHohensee Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

DiannaHohensee commented Jul 18, 2025 •

edited

Loading

mhl-b Jul 21, 2025 •

edited

Loading