Skip to content

Conversation

IzzyPutterman
Copy link
Contributor

No description provided.

Copy link
Contributor

@matthewkotila matthewkotila Sep 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@IzzyPutterman is this PR still something we want merged?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes I think we do. We can query the trtllm backend with this, when hosted by triton.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same question for Izzy a few months later. Is this feature already in with the direct C-API TRT-LLM engine endpoint-type? If so, we could close out this PR.

CC: @debermudez

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants