GH-3657: Fix DeepSeek tool call content null issue #3817

Acacian · 2025-07-14T11:53:34Z

✨ Summary

This PR resolves the issue where DeepSeek chat completions with tool calls return an assistant message with content = null, leading to schema validation errors or downstream failures (e.g., in sglang or FastAPI with pydantic).

To address this, we explicitly set the assistant message content to "__tool_call__" when tool calls are present but content is null.

🛠 Changes

In DeepSeekChatModel, updated buildGeneration(...) logic:
- If content == null && toolCalls != empty, set content = "__tool_call__" to maintain compatibility.
This change preserves downstream contract expectations, especially for clients expecting content to be a non-null string.
No changes to the base AssistantMessage#getText() logic, keeping it backward compatible for other models.

💡 Context

DeepSeek sometimes returns:

{
  "role": "assistant",
  "content": null,
  "tool_calls": [...]
}

This results in downstream consumers (e.g., sglang, FastAPI with pydantic) throwing errors:

ChatCompletionMessageGenericParam.content Field required [type=missing]

By ensuring a placeholder content ("__tool_call__"), we:

Avoid null pointer or validation errors
Make the message displayable in logs or UIs
Align behavior with other models that always return non-null content

🧪 Tests & Verification

✅ Added unit test: ChatClientResponseTests.whenAssistantMessageHasOnlyToolCalls_thenContentIsToolCallMarker
✅ Validated getText() returns "__tool_call__" when expected
✅ Verified compatibility with Generation.getOutput().getText()
✅ All existing tests passed (./mvnw clean verify)
✅ Checkstyle passed (-Ddisable.checks=false)
✅ Apache license headers added
✅ DCO signed
✅ Linear commit history (git log --oneline --graph confirmed)
✅ Rebased on latest main

📘 Documentation

JavaDoc and inline comments added where applicable to clarify the fallback behavior.
No changes required to public APIs or developer-facing contracts.

🔗 Closes #3657

Please review and let me know if you’d like any refinements.
Happy to iterate!

Closes spring-projects#3657 * Add content fallback for DeepSeek when only tool calls are present * Ensures AssistantMessage has proper output for downstream processing * Add test case for AssistantMessage with tool calls only Signed-off-by: Dongha Koo <[email protected]>

...ai-client-chat/src/test/java/org/springframework/ai/chat/client/ChatClientResponseTests.java

dev-jonghoonpark · 2025-07-14T12:19:22Z

Please update the copyright end year from 2024 to 2025.

…t/client/ChatClientResponseTests.java Co-authored-by: jonghoonpark <[email protected]> Signed-off-by: 구동하 <[email protected]>

Closes spring-projects#3657 * Set content = "__tool_call__" when content is null and toolCalls are present * Prevents schema validation errors in downstream clients * Add unit test: ChatClientResponseTests.whenAssistantMessageHasOnlyToolCalls_thenContentIsToolCallMarker * Apply code style: import formatting, copyright year Signed-off-by: Dongha Koo <[email protected]>

Acacian · 2025-07-14T14:09:11Z

@dev-jonghoonpark
Thanks for the reminder regarding the import formatting and copyright year — I've made the changes accordingly.
I'll be more mindful of the style guidelines going forward. Appreciate your review!

mxsl-gr · 2025-07-17T02:40:07Z

I don’t think this is an issue with Spring AI, it seems more like a problem with sglang

mxsl-gr · 2025-07-17T03:00:12Z

I'm not very familiar with Python and Pydantic, since my main programming languages are Kotlin, Java, TypeScript, and Go.

But maybe changing this line in sglang ChatCompletionMessageGenericParam:content to content: Optional[Union[str, List[ChatCompletionMessageContentTextPart], None]] might work.

Acacian · 2025-07-17T06:52:55Z

@mxsl-gr
Thanks again for the discussion!

You're right to consider that — but in fact, the field is already defined as
content: Union[str, List[ChatCompletionMessageContentTextPart], None],
which is semantically equivalent to Optional[...]. So None is already accepted by the type system.

However, this doesn’t resolve the issue, because the validation error doesn’t originate from the type annotation itself —
it occurs downstream, in tools like OpenAPI schema generators or FastAPI/Pydantic,
where null values may still be interpreted as invalid or required, depending on how the schema is rendered or consumed.

Acacian · 2025-07-17T07:15:30Z

Just to clarify why this is still relevant for Spring AI:

Even if the root cause originates in sglang, the validation failure (e.g., content: null causing OpenAPI schema errors or FastAPI crashes) is surfaced through Spring AI's API layer — such as .getOutput().getText() or generated client interfaces.

But more importantly, Spring AI is not just a passive proxy of the upstream response — it actively maps, serializes, and exposes model output to downstream clients. This includes constructing OpenAPI schemas, wrapping tool call messages, and integrating with frameworks like Spring Web or WebFlux.

Therefore, Spring AI shares responsibility for ensuring that the responses it produces are schema-compliant and consumer-friendly — regardless of upstream behavior. Even if upstream systems are improved later, Spring AI should guard against breakage in the current ecosystem.

This PR addresses the issue at the precise point of failure and control, and does so in a backward-compatible, non-invasive way — stabilizing behavior for all downstream consumers that rely on consistent content output.

dev-jonghoonpark reviewed Jul 14, 2025

View reviewed changes

...ai-client-chat/src/test/java/org/springframework/ai/chat/client/ChatClientResponseTests.java Outdated Show resolved Hide resolved

Acacian and others added 2 commits July 14, 2025 22:20

Update spring-ai-client-chat/src/test/java/org/springframework/ai/cha…

47f2ba8

…t/client/ChatClientResponseTests.java Co-authored-by: jonghoonpark <[email protected]> Signed-off-by: 구동하 <[email protected]>

Acacian force-pushed the GH-3657-deepseek-toolcall-fix branch from 55067cc to 4b054b0 Compare July 14, 2025 14:04

markpollack added the deepseek label Jul 15, 2025

ilayaperumalg added this to the 1.1.x milestone Jul 16, 2025

ilayaperumalg added the for: backport-to-1.0.x label Jul 16, 2025

ilayaperumalg self-assigned this Jul 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GH-3657: Fix DeepSeek tool call content null issue #3817

GH-3657: Fix DeepSeek tool call content null issue #3817

Acacian commented Jul 14, 2025

Uh oh!

Uh oh!

dev-jonghoonpark commented Jul 14, 2025

Uh oh!

Acacian commented Jul 14, 2025

Uh oh!

mxsl-gr commented Jul 17, 2025

Uh oh!

mxsl-gr commented Jul 17, 2025 •

edited

Loading

Uh oh!

Acacian commented Jul 17, 2025 •

edited

Loading

Uh oh!

Acacian commented Jul 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

GH-3657: Fix DeepSeek tool call content null issue #3817

Are you sure you want to change the base?

GH-3657: Fix DeepSeek tool call content null issue #3817

Conversation

Acacian commented Jul 14, 2025

✨ Summary

🛠 Changes

💡 Context

🧪 Tests & Verification

📘 Documentation

🔗 Closes #3657

Uh oh!

Uh oh!

dev-jonghoonpark commented Jul 14, 2025

Uh oh!

Acacian commented Jul 14, 2025

Uh oh!

mxsl-gr commented Jul 17, 2025

Uh oh!

mxsl-gr commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Acacian commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Acacian commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mxsl-gr commented Jul 17, 2025 •

edited

Loading

Acacian commented Jul 17, 2025 •

edited

Loading

Acacian commented Jul 17, 2025 •

edited

Loading