fix(bedrock): Add prompt caching support for Converse API #3390

AlanPonnachan · 2025-09-19T14:51:07Z

I have added tests that cover my changes.
If adding a new instrumentation or changing an existing one, I've added screenshots from some observability platform showing the change.
PR name follows conventional commits format: feat(instrumentation): ... or fix(instrumentation): ....
(If applicable) I have updated the documentation accordingly.

Description

This PR introduces prompt caching telemetry for the AWS Bedrock Converse and Converse Stream APIs, bringing feature parity with the existing invoke_model instrumentation.

The Converse API reports caching information in the usage field of the response body, rather than through HTTP headers. This implementation adds the necessary logic to parse this information and record it as metrics and span attributes.

Changes include:

New function prompt_caching_converse_handling in prompt_caching.py to extract cache_read_input_tokens and cache_creation_input_tokens from the response body.
Integration into __init__.py: The new function is now called from _handle_converse and _handle_converse_stream to process caching data for both standard and streaming calls.
New Test File: Added test_bedrock_converse_prompt_caching_metrics.py to validate that the gen_ai.prompt.caching metric is correctly emitted for the Converse API.

Fixes #3337

Important

Adds prompt caching telemetry for AWS Bedrock Converse APIs, including new function for caching data extraction and corresponding tests.

Behavior:
- Adds prompt_caching_converse_handling in prompt_caching.py to extract caching data from Converse API response body.
- Integrates prompt_caching_converse_handling into _handle_converse and _handle_converse_stream in __init__.py.
Testing:
- Adds test_bedrock_converse_prompt_caching_metrics.py to validate gen_ai.prompt.caching metric emission for Converse API.
Misc:
- Fixes 🚀 Feature: Add prompt caching for Bedrock Converse #3337.

^{This description was created by}^{for 4fa3792. You can customize this summary. It will automatically update as commits are pushed.}

CLAassistant · 2025-09-19T14:51:20Z

All committers have signed the CLA.

ellipsis-dev

Caution

Changes requested ❌

Reviewed everything up to 4fa3792 in 2 minutes and 2 seconds. Click for details.

Reviewed 156 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 2 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. packages/opentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/__init__.py:359

Draft comment:
Good integration of prompt_caching_converse_handling in _handle_converse. In the streaming handler (lines ~400), note that if both read and write tokens are present, the span attribute may be overwritten. Ensure this is the intended behavior.
Reason this comment was not posted:
Comment was on unchanged code.

2. packages/opentelemetry-instrumentation-bedrock/tests/metrics/test_bedrock_converse_prompt_caching_metrics.py:56

Draft comment:
The test correctly validates prompt caching metrics for Converse API. The cumulative workaround for metric values indicates the underlying counter is cumulative. Consider resetting metrics between tests to avoid cross-test interference if possible.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% The comment has two parts: 1) An observation about the cumulative nature of the metrics which is already documented in the code comments, and 2) A speculative suggestion about resetting metrics that isn't clearly necessary since the current approach works. The comment doesn't identify any actual problems or required changes. The suggestion about resetting metrics could be valid if there's evidence of cross-test interference, but we don't see any such evidence. The current workaround seems intentional and functional. Since the current approach is working and documented, and there's no evidence of actual problems, the suggestion is more speculative than necessary. Delete the comment as it's primarily informative/observational and makes a speculative suggestion without clear evidence of need for change.

Workflow ID: wflow_bNUeXv3pUdPPxbhz

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

...pentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/prompt_caching.py

…nstrumentation/bedrock/prompt_caching.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

nirga

hey @AlanPonnachan - looks like tests are failing, can you take a look?

….com/AlanPonnachan/openllmetry into feat-bedrock-converse-prompt-caching

AlanPonnachan · 2025-09-19T21:05:47Z

Hi @nirga

I’ve resolved the lint test failures. The remaining failing test, test_prompt_cache_converse, is expected since it requires a VCR cassette to be recorded.

As I don’t have access to an active AWS account, I’m unable to generate the test_prompt_cache_converse.yaml cassette file myself. Would you be able to check out this branch, run the test and push the generated cassette file to this PR?

Thanks for your help!

nirga

Sure @AlanPonnachan, will do it - can you fix the small comment I wrote? I'll then run it locally and record a test. BTW - if you can rely on existing converse tests it might be easier

...pentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/prompt_caching.py

AlanPonnachan · 2025-09-21T06:24:50Z

Thanks for the great suggestion and for your willingness to help record the test!

I agree that relying on an existing test is a cleaner approach. Before I push the changes, I just want to confirm my plan sounds good to you.

Here is what I am planning to do:

Modify the Existing Test: I will update the test_titan_converse function in tests/traces/test_titan.py.
Enable Caching: I'll add the additionalModelRequestFields with cacheControl to the existing brt.converse API call.
Test Both Scenarios: I will add a second brt.converse call within that same test to ensure we cover both the initial "cache-write" and the subsequent "cache-read".
Add Assertions: I will add the metric assertions I wrote to validate that the prompt caching counters are working correctly.
Clean Up: Finally, I will delete the new test file I originally created (test_bedrock_converse_prompt_caching_metrics.py).

This will result in the cassette for test_titan_converse.yaml needing to be re-recorded, as you mentioned.

Does this plan look good? If so, I'll go ahead and make the changes.

AlanPonnachan added 3 commits September 19, 2025 20:02

add converse handling function in prompt_caching

ff8e343

update init

83bdb0b

add test

4fa3792

ellipsis-dev bot reviewed Sep 19, 2025

View reviewed changes

...pentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/prompt_caching.py Outdated Show resolved Hide resolved

Update packages/opentelemetry-instrumentation-bedrock/opentelemetry/i…

c8abf60

…nstrumentation/bedrock/prompt_caching.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

nirga changed the title ~~feat(bedrock): Add prompt caching support for Converse API~~ fix(bedrock): Add prompt caching support for Converse API Sep 19, 2025

nirga reviewed Sep 19, 2025

View reviewed changes

AlanPonnachan added 2 commits September 20, 2025 01:49

correct lint test

7d65ff8

Merge branch 'feat-bedrock-converse-prompt-caching' of https://github…

6cf3366

….com/AlanPonnachan/openllmetry into feat-bedrock-converse-prompt-caching

nirga reviewed Sep 19, 2025

View reviewed changes

...pentelemetry-instrumentation-bedrock/opentelemetry/instrumentation/bedrock/prompt_caching.py Show resolved Hide resolved

add token data

a3c94e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(bedrock): Add prompt caching support for Converse API #3390

fix(bedrock): Add prompt caching support for Converse API #3390

Uh oh!

AlanPonnachan commented Sep 19, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

CLAassistant commented Sep 19, 2025 •

edited

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

nirga left a comment

Uh oh!

AlanPonnachan commented Sep 19, 2025

Uh oh!

nirga left a comment

Uh oh!

Uh oh!

AlanPonnachan commented Sep 21, 2025

Uh oh!

Uh oh!

fix(bedrock): Add prompt caching support for Converse API #3390

Are you sure you want to change the base?

fix(bedrock): Add prompt caching support for Converse API #3390

Uh oh!

Conversation

AlanPonnachan commented Sep 19, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

CLAassistant commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nirga left a comment

Choose a reason for hiding this comment

Uh oh!

AlanPonnachan commented Sep 19, 2025

Uh oh!

nirga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlanPonnachan commented Sep 21, 2025

Uh oh!

Uh oh!

AlanPonnachan commented Sep 19, 2025 •

edited by ellipsis-dev bot

Loading

CLAassistant commented Sep 19, 2025 •

edited

Loading