[ML] Flag updates from Inference #131725

prwhelan · 2025-07-22T18:54:21Z

Flag updates from Inference so Serverless can detect them.
Swap tests to set adaptive allocations rather than num allocations to pass in serverless.

elasticsearchmachine · 2025-07-22T18:55:38Z

Pinging @elastic/ml-core (Team:ML)

jan-elastic

LGTM

jan-elastic · 2025-07-24T06:56:16Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

+
+        public void setFromInference(boolean fromInference) {
+            this.fromInference = fromInference;
+            this.isInternal = fromInference;


It looks confusing the setFromInference also sets isInternal.

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

jan-elastic · 2025-07-24T07:33:12Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

@@ -27,6 +27,7 @@
 import java.io.IOException;
 import java.util.Objects;


I'm missing a bit of context: why do we need to distinguish between these cases?

Is there a corresponding Serverless PR?

Yeah, let me ping you with the internal documentation

I'm missing a bit of context: why do we need to distinguish between these cases?

We need to allow updates to num_allocations in serverless that originate from the AdaptiveAllocationsScalerService (ADAPTIVE_ALLOCATIONS), but we want to disallow updates from users (API and INFERENCE). The only alternative I thought of was refactoring AdaptiveAllocationsScalerService to update directly rather than through the API, but that felt more intrusive.

…dates

jonathan-buttner · 2025-08-19T14:44:00Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

+                    // we changed over from a boolean to an enum
+                    // when it was a boolean, true came from adaptive allocations and false came from the rest api
+                    // treat "inference" as if it came from the api
+                    out.writeBoolean(isInternal());


Do we need to determine if source == Source.ADAPTIVE_ALLOCATIONS here? Since this will return true for Source.INFERENCE as well?

Previously, we set the boolean to true if the source was either from the inference update api or the adaptive allocations autoscaler. out.writeBoolean(isInternal()) preserves this logic (i think). It means the stream reader will think an inference api call is an adaptive allocations api call, but that only affects serverless which is only mixed cluster during a rolling update.

jonathan-buttner · 2025-08-19T14:48:06Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

@@ -119,11 +131,15 @@ public void setAdaptiveAllocationsSettings(AdaptiveAllocationsSettings adaptiveA
        }

        public boolean isInternal() {
-            return isInternal;
+            return source == Source.INFERENCE || source == Source.ADAPTIVE_ALLOCATIONS;


Can you confirm that we do want Source.INFERENCE here for all the usage of isInternal() below?

Confirmed! Yeah inference update code previously set isInternal to true (back when the boolean existed)

[ML] Use adaptive allocations in test

b124b8f

prwhelan added >test Issues or PRs that are addressing/adding tests :ml Machine learning Team:ML Meta label for the ML team v9.2.0 labels Jul 22, 2025

elasticsearchmachine added the serverless-linked Added by automation, don't add manually label Jul 22, 2025

prwhelan marked this pull request as ready for review July 22, 2025 18:55

Merge branch 'main' into block-updates

061cc5e

jan-elastic approved these changes Jul 23, 2025

View reviewed changes

Flag when updates come from the inference endpoint

9ddaec4

prwhelan changed the title ~~[ML] Use adaptive allocations in test~~ [ML] Flag updates from Inference Jul 23, 2025

prwhelan requested a review from jan-elastic July 23, 2025 20:54

jan-elastic reviewed Jul 24, 2025

View reviewed changes

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java Outdated Show resolved Hide resolved

jan-elastic reviewed Jul 24, 2025

View reviewed changes

prwhelan added 3 commits July 29, 2025 09:09

Merge branch 'main' of github.com:elastic/elasticsearch into block-up…

b97a4ca

…dates

Change booleans to enum

0107a7c

Merge branch 'main' into block-updates

00f7ba7

jonathan-buttner approved these changes Aug 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Flag updates from Inference #131725

[ML] Flag updates from Inference #131725

prwhelan commented Jul 22, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Jul 22, 2025

Uh oh!

jan-elastic left a comment

Uh oh!

jan-elastic Jul 24, 2025

Uh oh!

Uh oh!

jan-elastic Jul 24, 2025

Uh oh!

jan-elastic Jul 24, 2025

Uh oh!

prwhelan Jul 24, 2025

Uh oh!

prwhelan Jul 24, 2025

Uh oh!

jonathan-buttner Aug 19, 2025

Uh oh!

prwhelan Aug 19, 2025

Uh oh!

jonathan-buttner Aug 19, 2025

Uh oh!

prwhelan Aug 19, 2025

Uh oh!

Uh oh!

		@@ -27,6 +27,7 @@
		import java.io.IOException;
		import java.util.Objects;

[ML] Flag updates from Inference #131725

Are you sure you want to change the base?

[ML] Flag updates from Inference #131725

Conversation

prwhelan commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jul 22, 2025

Uh oh!

jan-elastic left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

prwhelan commented Jul 22, 2025 •

edited

Loading