Add tokenizer tests and fix auto-compact calculation to account for max_tokens #7342

sestinj · 2025-08-22T18:19:22Z

Summary by cubic

Fix auto-compact to properly account for model max_tokens and prevent context overflow. Add unit tests for tokenizer functions and compaction logic.

Bug Fixes
- Reserve output tokens using maxTokens; default to 35% when unset.
- Compute usage against available input (contextLength - reservedForOutput) and respect the 80% threshold.
- Throw when maxTokens >= contextLength.
- Improve debug logging with input, limits, and decision details.
Tests
- Add tokenizer.test.ts covering message/chat token counts, usage %, and compaction with/without maxTokens, including edge cases.
- Include real-world scenarios (e.g., Claude 200k with 4k max, GPT-4 128k with 4k) and mocks for logger/tokenizer.

…ax_tokens

github-actions · 2025-08-22T18:19:31Z

⚠️ PR Title Format

Your PR title doesn't follow the conventional commit format, but this won't block your PR from being merged. We recommend using this format for better project organization.

Expected Format:

<type>[optional scope]: <description>

Examples:

feat: add changelog generation support
fix: resolve login redirect issue
docs: update README with new instructions
chore: update dependencies

Valid Types:

feat, fix, docs, style, refactor, perf, test, build, ci, chore, revert

This helps with:

📝 Automatic changelog generation
🚀 Automated semantic versioning
📊 Better project history tracking

This is a non-blocking warning - your PR can still be merged without fixing this.

cubic-dev-ai

1 issue found across 3 files

_{React with 👍 or 👎 to teach cubic. You can also tag @cubic-dev-ai to give feedback, ask questions, or re-run the review.}

cubic-dev-ai · 2025-08-22T18:24:38Z

extensions/cli/src/util/tokenizer.ts

+
+  // Ensure we have positive space available for input
+  if (availableForInput <= 0) {
+    throw new Error(`max_tokens is larger than context_length, which should not be possible. Please check your configuration.`);


Error message is misleading when triggered by small context limits with default 35% reservation

Prompt for AI agents

Address the following comment on extensions/cli/src/util/tokenizer.ts at line 140: <comment>Error message is misleading when triggered by small context limits with default 35% reservation</comment> <file context> @@ -117,20 +117,37 @@ export function calculateContextUsagePercentage( /** * Check if the chat history exceeds the auto-compact threshold * @param chatHistory The chat history to check - * @param modelName The model name + * @param model The model configuration * @returns Whether auto-compacting should be triggered */ export function shouldAutoCompact( chatHistory: ChatCompletionMessageParam[], </file context>

Add tokenizer tests and fix auto-compact calculation to account for m…

a4f60b9

…ax_tokens

sestinj requested a review from a team as a code owner August 22, 2025 18:19

sestinj requested review from Patrick-Erichsen and removed request for a team August 22, 2025 18:19

github-project-automation bot moved this to Todo in Issues and PRs Aug 22, 2025

github-project-automation bot added this to Issues and PRs Aug 22, 2025

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Aug 22, 2025

cubic-dev-ai bot reviewed Aug 22, 2025

View reviewed changes

sestinj added 14 commits August 22, 2025 12:53

Merge branch 'main' into nate-3

1e4b1d6

chore: fix import order

4793da0

fix: address feedback

feefb7d

chore: format

b6cfc7d

fix: jb tests

5955488

fix: pin to ffmpeg version 7.1

5b83e47

fix: truncate Read tool output to avoid context overflow

ee79bff

fix: workflow dispatch default proper type

3585250

fix: vaildate aliases fix core

1000694

fix: auto compaction between tool calls

4c829d7

fix: prune before compacting

04e35c5

chore: format

ca05756

feat: info slash command

decd562

ci: log when tests are re-run

1d915c1

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Aug 23, 2025

sestinj added 3 commits August 23, 2025 10:50

fix: tests

a93c2bc

chore: format

bb41c54

fix: more tests

d235cb4

sestinj merged commit 7e30a63 into main Aug 23, 2025
80 of 84 checks passed

sestinj deleted the nate-3 branch August 23, 2025 18:14

github-project-automation bot moved this from Todo to Done in Issues and PRs Aug 23, 2025

github-actions bot locked and limited conversation to collaborators Aug 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tokenizer tests and fix auto-compact calculation to account for max_tokens #7342

Add tokenizer tests and fix auto-compact calculation to account for max_tokens #7342

Uh oh!

sestinj commented Aug 22, 2025 •

edited by cubic-dev-ai bot

Loading

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot Aug 22, 2025

Uh oh!

Uh oh!

Uh oh!

Add tokenizer tests and fix auto-compact calculation to account for max_tokens #7342

Add tokenizer tests and fix auto-compact calculation to account for max_tokens #7342

Uh oh!

Conversation

sestinj commented Aug 22, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

github-actions bot commented Aug 22, 2025

⚠️ PR Title Format

Expected Format:

Examples:

Valid Types:

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sestinj commented Aug 22, 2025 •

edited by cubic-dev-ai bot

Loading