Add served-model support to serving endpoint query #1044
+17
−2
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What changes are proposed in this pull request?
This PR adds support for querying an individual model in the serving endpoints as documented here.
It adds served-models to the url used for the query requests if
served_model_name
is provided.While querying an individual model behind a serving endpoint is supported by rest api, it is not supported by the sdk. This limits users flexibility in serving multiple models in one endpoint. I raised this problem in this issue.
How is this tested?
I tested this with an endpoint in a private databricks workspace.
Describe any tests you have done; especially if test tests are not part of
the unit tests (e.g. local tests).
I couldn't find any test case for the query methods, therefore the answer is N/A.