Model: Seed OSS thinking + tool call support #15552

pwilkin · 2025-08-24T23:53:32Z

This one has been an absolute nightmare to implement (Seed OSS tool calling is basically Qwen Coder all over again), so I hope this actually works (testing this on my Q2_K_S which fails to call the tools properly every second time is a nightmare as well).

bfroemel · 2025-08-27T12:12:07Z

fyi, small jinja template update (mostly comment translation, but one functional difference at the end): https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Instruct/commit/497f1dca95ebdec98e41d517b9f060ee753c902f#d2h-526183

pwilkin · 2025-08-27T14:39:17Z

fyi, small jinja template update (mostly comment translation, but one functional difference at the end): https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Instruct/commit/497f1dca95ebdec98e41d517b9f060ee753c902f#d2h-526183

Thanks, I'll retest with new template.

pwilkin · 2025-08-29T09:22:52Z

Okay, verified, both the tests and the real life tool calling test run on my trusty Q2_K_S quant pass fine.

@CISC any chance you could take a quick look at this? Apparently the model has gotten popular and some people want to get it merged :>

CISC

LGTM

common/chat.cpp

Co-authored-by: Sigbjørn Skjæret <[email protected]>

pwilkin · 2025-08-29T10:44:01Z

Aight, should be good to go.

ExtReMLapin · 2025-08-29T13:21:11Z

I could be missing something but your builder.add_rule("root", string_join(tool_rules, " | ")); seems to lack the reasoning grammar.

Which means using tool_choice = "required" disables thinking.

ExtReMLapin · 2025-08-29T13:56:02Z

./build/bin/llama-server -ngl 999 -fa -m Seed-OSS-36B-Instruct-Q5_K_M.gguf -c 131071 --jinja --reasoning-format deepseek --slots --n-predict 131071 --no-context-shift -ctk q8_0 -ctv q8_0 --parallel 1 --port 2483 --host 192.168.25.25 --chat-template-file models/templates/ByteDance-Seed-OSS.jinja

Gave a quick try, tool calling results are terrible with very simple example, but it's probably the model's fault.

Gave another try with a tool we use , long context and it's really doing something off :

I feel like it's not streaming correctly, it's sending the whole tool call at once
there is something weird going on with reasoning (after tool call ???)

With tool_choice at required we should have a reasoning_content, no content, and BEFORE the tool call

to run the same query you can use node and this file, please note you might need multiple tries :
oss_js_node_debug.js

pwilkin · 2025-08-29T14:21:25Z

Why do the meaningful testing comments always start just as the PR gets merged? 😆

@ExtReMLapin I noticed this type of behavior too, but I'm not sure if this is a mistake or whether this is an intended feature of tool calls happening within the reasoning, which would mean we'd have to rework the grammar.

I've only tried tool calling on my Q2_K_S, but it works without any problems, at least on the simple tool calls I've tried (web search / execute shell command).

ExtReMLapin · 2025-08-29T14:26:50Z

Why do the meaningful testing comments always start just as the PR gets merged? 😆

Because i'm a lazy man and pulling it on my own branch is already too much effort !

As for the reasoning, I'm already surprised it happens, here with qwen, right now on master it doesn't happen because it's not allowed by grammar (in required tool calling mode).
If you want an example I opened a PR to support thinking AND tool calling @ required

#15248

I've only tried tool calling on my Q2_K_S, but it works without any problems,

I don't see why everyone seems so happy with this model, from my tests it's VERY VERY meh, nothing close to Qwen3

Maybe i'm just doing something wrong, but either way we should NOT see the thinking tags, it should either not think or parse the thinking tags and shove the text into reasoning_content

pwilkin · 2025-08-29T14:30:35Z

Maybe i'm just doing something wrong, but either way we should NOT see the thinking tags, it should either not think or parse the thinking tags and shove the text into reasoning_content

True - are you using the official updated template? (from https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Instruct/blob/main/chat_template.jinja)?

ExtReMLapin · 2025-08-29T16:15:44Z

As you can see in the command I posted I'm using the one you pushed in your PR --chat-template-file models/templates/ByteDance-Seed-OSS.jinja

* Reasoning and tool-calling support for Seed OSS * Fix grammar and partial parsing * Whitespace * New chat template * Update common/chat.cpp Co-authored-by: Sigbjørn Skjæret <[email protected]> * Update common/chat.cpp Co-authored-by: Sigbjørn Skjæret <[email protected]> * Remove unused 'purge_healing_marker' helper --------- Co-authored-by: Sigbjørn Skjæret <[email protected]>

pwilkin added 2 commits August 24, 2025 23:04

Reasoning and tool-calling support for Seed OSS

f7638ea

Fix grammar and partial parsing

50f7179

github-actions bot added the testing Everything test related label Aug 24, 2025

Whitespace

f1b7e46

pwilkin added 2 commits August 29, 2025 10:27

Merge branch 'ggml-org:master' into seed-oss-thinking

88018a6

New chat template

1083a79

CISC approved these changes Aug 29, 2025

View reviewed changes

common/chat.cpp Outdated Show resolved Hide resolved

common/chat.cpp Outdated Show resolved Hide resolved

common/chat.cpp Outdated Show resolved Hide resolved

pwilkin and others added 3 commits August 29, 2025 12:42

Update common/chat.cpp

5a52138

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update common/chat.cpp

3b2acb0

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Remove unused 'purge_healing_marker' helper

8de13d6

CISC merged commit 60e5eee into ggml-org:master Aug 29, 2025
47 of 48 checks passed

pwilkin mentioned this pull request Sep 1, 2025

Eval bug: Granite 4.0 Invalid diff: '<|tool_call|>["1025202362"]' not found at start of '<|tool_call|>["1350490027"]' #15713

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model: Seed OSS thinking + tool call support #15552

Model: Seed OSS thinking + tool call support #15552

pwilkin commented Aug 24, 2025

Uh oh!

bfroemel commented Aug 27, 2025

Uh oh!

pwilkin commented Aug 27, 2025

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

CISC left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

Uh oh!

ExtReMLapin commented Aug 29, 2025

Uh oh!

ExtReMLapin commented Aug 29, 2025 •

edited

Loading

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

ExtReMLapin commented Aug 29, 2025 •

edited

Loading

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

ExtReMLapin commented Aug 29, 2025

Uh oh!

Uh oh!

Model: Seed OSS thinking + tool call support #15552

Model: Seed OSS thinking + tool call support #15552

Conversation

pwilkin commented Aug 24, 2025

Uh oh!

bfroemel commented Aug 27, 2025

Uh oh!

pwilkin commented Aug 27, 2025

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

Uh oh!

ExtReMLapin commented Aug 29, 2025

Uh oh!

ExtReMLapin commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

ExtReMLapin commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pwilkin commented Aug 29, 2025

Uh oh!

ExtReMLapin commented Aug 29, 2025

Uh oh!

Uh oh!

ExtReMLapin commented Aug 29, 2025 •

edited

Loading

ExtReMLapin commented Aug 29, 2025 •

edited

Loading