Feat: Add Optional Structured Session Metadata #1474

habema · 2025-08-14T13:04:28Z

Resolves #1385

This PR introduces an optional structured storage mode to SQLiteSession to improve the observability and query-ability of conversation histories.

When a user initializes a session with structured=True, two new tables are created and populated alongside the existing raw JSON log:

agent_conversation_messages: Stores distinct user, assistant, and system messages.
agent_tool_calls: Records tool call invocations and their corresponding outputs.

This makes it significantly easier to analyze and debug agent interactions using standard SQL, addressing the limitations of querying JSON blobs.

Key points:

The feature is opt-in. Existing implementations are unaffected.
Foreign key constraints ensure data integrity when items are popped or sessions are cleared.
Includes comprehensive unit tests and updated documentation.

…tool calls

habema · 2025-08-19T13:35:50Z

Switching this back to a draft as it needs some reviewing

habema · 2025-08-20T10:51:16Z

In true developer fashion, I might have dived a bit too deep. Yet, this all would be very beneficial to my current project.

The problem with my earlier proposed schema is that it lacks a fundamental feature, the ability to connect raw events to each other (user message to assistant message, tool calls to the invoking user message or resulting assistant message(s), usage, etc.), very similarly to tracing functionality.

To sum up the goal, its to make mimic tracing functionality in a structured and query-able manner.

Added spans and linkage
- agent_conversation_messages: added parent_raw_event_id, trace_id, span_id. User rows keep the agent span; assistant rows are now attributed to the model’s generation/response span.
- agent_tool_calls: added trace_id, span_id.
Introduced agent_usage to record per‑response usage: model, requests, token counts, details, and trace_id/span_id. Indexed by (trace_id, created_at) and response_id.
Configuration
- The opt‑in flag is now structured_metadata=True (formerly structured=True).
- Existing non‑structured behavior is unchanged.
Necessary tests, docs, and demo are included.

This keeps the feature opt‑in while making stored conversations, tool calls, and usage easy to analyze with SQL, and ensures accurate span attribution for observability.

seratch · 2025-08-26T04:37:31Z

Thanks for sending this PR. I quickly checked the changes and felt the current changes in this PR make the SDK internals way more complex, so I am a bit hesitant to have this.

habema · 2025-08-26T08:35:47Z

Thank you for the feedback. I understand the concern about complexity.

To clarify:

Is the core idea of structured session metadata (making conversations queryable via SQL) welcome, but the current implementation too complex? If so, I'd be happy to look into alternative approaches. Perhaps just the basic structured tables (messages, tool calls, usage) without the tracing ingestion, or a implement this as an extension to separate it from the SDK internals.
Or is the core concept itself considered too complex for the SDK? If the tracing integration is the main concern, I could remove that entirely and focus just on the basic structured storage tables.

As mentioned earlier.

To sum up the goal, its to make mimic tracing functionality in a structured and query-able manner.

For my usecase, exact trace_ids and span_ids are not important, I just want to be able to connect every full interaction and its usage.

seratch · 2025-08-26T12:21:59Z

Is the core idea of structured session metadata (making conversations queryable via SQL) welcome, but the current implementation too complex?

Yes, it is. As we discussed at the issue, providing an option to effectively use relational database schema is totally fine but I don't think we need tracing integration this time. We already have sessions feature, so I was assuming that we can have yet another solution for the same purpose.

a implement this as an extension to separate it from the SDK internals.

I haven't checked how this can be at all, but if you have an idea like this in your mind, this could be more clean, plus enhancing the layer could be easier for users too.

habema and others added 3 commits August 14, 2025 15:55

implement structured storage with additional tables for messages and …

3e415c0

…tool calls

fix mypy

2e450b3

Merge branch 'main' into feat/enhanced-session-schema

a9e1eb1

seratch added documentation Improvements or additions to documentation enhancement New feature or request feature:sessions labels Aug 14, 2025

habema added 2 commits August 16, 2025 04:04

Merge branch 'main' into feat/enhanced-session-schema

3419ae7

Merge branch 'main' into feat/enhanced-session-schema

acd3ada

habema marked this pull request as draft August 19, 2025 13:35

habema and others added 4 commits August 20, 2025 13:23

structured metadata revised implementation with docs and demo

ded4aa0

Merge branch 'main' into feat/enhanced-session-schema

374a1fb

remove traces of local testing migration logic

f676420

cleanup

1e0410c

habema marked this pull request as ready for review August 20, 2025 10:49

habema changed the title ~~Feat: Add Optional Structured Session Storage~~ Feat: Add Optional Structured Session Metadata Aug 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: Add Optional Structured Session Metadata #1474

Feat: Add Optional Structured Session Metadata #1474

Uh oh!

habema commented Aug 14, 2025 •

edited

Loading

Uh oh!

habema commented Aug 19, 2025

Uh oh!

habema commented Aug 20, 2025 •

edited

Loading

Uh oh!

seratch commented Aug 26, 2025

Uh oh!

habema commented Aug 26, 2025

Uh oh!

seratch commented Aug 26, 2025

Uh oh!

Uh oh!

Feat: Add Optional Structured Session Metadata #1474

Are you sure you want to change the base?

Feat: Add Optional Structured Session Metadata #1474

Uh oh!

Conversation

habema commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

habema commented Aug 19, 2025

Uh oh!

habema commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seratch commented Aug 26, 2025

Uh oh!

habema commented Aug 26, 2025

Uh oh!

seratch commented Aug 26, 2025

Uh oh!

Uh oh!

habema commented Aug 14, 2025 •

edited

Loading

habema commented Aug 20, 2025 •

edited

Loading