fix: propagate finish_reason from LiteLLM responses #3114

aperepel · 2025-10-08T17:38:32Z

Summary

This PR ensures that the finish_reason field from LiteLLM responses is properly propagated to LlmResponse objects, enabling after_model_callback functions to detect completion conditions like max_tokens truncation.

Changes

src/google/adk/models/lite_llm.py
- Modified _model_response_to_generate_content_response() to extract finish_reason from LiteLLM response
- Sets llm_response.finish_reason when present in the response
src/google/adk/telemetry/tracing.py
- Updated trace_call_llm() to handle both enum (Gemini) and string (LiteLLM) finish_reason values
- Uses hasattr() check to detect enum vs string type
tests/unittests/models/test_litellm.py
- Added 4 comprehensive unit tests covering different finish_reason scenarios
- Tests cover: "length", "stop", "tool_calls", and "content_filter"

Test Results

All tests pass:

✅ 53/53 tests in test_litellm.py pass
✅ All 4 new finish_reason tests pass
✅ No existing tests broken

Impact

This fix allows after_model_callback functions to properly detect:

"length": max_tokens limit reached
"stop": natural completion
"tool_calls": tool invocations
"content_filter": filtered content

This enables implementing retry logic for incomplete responses, logging completion statistics, and handling different completion conditions appropriately.

Fixes google#3109 This change ensures that the finish_reason field from LiteLLM responses is properly propagated to LlmResponse objects, enabling callbacks to detect completion conditions like max_tokens truncation. Changes: - Extract finish_reason from LiteLLM response in lite_llm.py - Update tracing.py to handle both enum (Gemini) and string (LiteLLM) finish_reason values - Add comprehensive unit tests for finish_reason propagation The fix allows after_model_callback functions to properly detect: - "length": max_tokens limit reached - "stop": natural completion - "tool_calls": tool invocations - "content_filter": filtered content

gemini-code-assist · 2025-10-08T17:38:45Z

Summary of Changes

Hello @aperepel, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This PR addresses issue #3109 by implementing the propagation of the finish_reason field from LiteLLM responses to LlmResponse objects. This allows downstream functions, like after_model_callback, to correctly interpret why a model response concluded. The changes include modifications to the LiteLLM model integration to extract this field, updates to the telemetry tracing to handle different finish_reason types (enum vs. string), and the addition of comprehensive unit tests to validate the propagation for various finish reasons ('length', 'stop', 'tool_calls', 'content_filter'). All existing and new tests pass, indicating a successful and robust implementation.

Highlights

Intent: This pull request fixes issue The LiteLLM wrapper does not propagate finish_reason from LiteLLM responses #3109 by ensuring that the finish_reason field from LiteLLM responses is correctly propagated to LlmResponse objects. This enables after_model_callback functions to accurately detect various completion conditions, such as max_tokens truncation, natural completion, tool invocations, or content filtering.
Changes: The changes involve modifying src/google/adk/models/lite_llm.py to extract and set the finish_reason from LiteLLM responses into the LlmResponse object. src/google/adk/telemetry/tracing.py was updated to gracefully handle finish_reason values that can be either enum (from Gemini) or string (from LiteLLM) by using a hasattr() check. Additionally, tests/unittests/models/test_litellm.py received four new unit tests to verify the proper propagation of finish_reason for 'length', 'stop', 'tool_calls', and 'content_filter' scenarios.
Reviewer Activity: No specific reviewer activity (comments, approvals, etc.) was provided in the context.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This is a great pull request that addresses an important issue with propagating the finish_reason. The changes in lite_llm.py and the new tests are well-targeted. I have a few suggestions to improve robustness and maintainability, particularly in tracing.py and the new tests.

src/google/adk/telemetry/tracing.py

src/google/adk/models/lite_llm.py

tests/unittests/models/test_litellm.py

- Use .name instead of .value for enum finish_reason (more robust for IntEnum) - Extract first choice using walrus operator for better readability - Consolidate tests using @pytest.mark.parametrize to reduce duplication - Strengthen test assertions to verify response content All 53 tests pass.

aperepel · 2025-10-08T17:47:44Z

/gemini review

gemini-code-assist

Code Review

Thank you for this contribution. This pull request effectively addresses the propagation of finish_reason from LiteLLM, which is a valuable enhancement for observing model completion states. I've identified a critical issue and a couple of areas for improvement to enhance the robustness and correctness of the code. Please see my detailed comments below.

src/google/adk/models/lite_llm.py

src/google/adk/telemetry/tracing.py

Addressing review comments Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Address type safety issue where finish_reason can be either: - types.FinishReason enum (from Gemini responses) - str (from LiteLLM responses) Updated LlmResponse.finish_reason type hint to: Optional[Union[types.FinishReason, str]] This ensures type checkers correctly validate the dual nature of this field across different model providers. All 53 tests pass.

aperepel · 2025-10-08T17:57:10Z

/gemini review

gemini-code-assist

Code Review

This is a great pull request that addresses an important issue with propagating the finish_reason. The changes are well-implemented and the addition of comprehensive unit tests is excellent. I have one suggestion regarding the data model for LlmResponse to improve long-term maintainability.

src/google/adk/models/llm_response.py

- Map finish_reason strings to proper FinishReason enum values in lite_llm.py - 'length' -> FinishReason.MAX_TOKENS - 'stop' -> FinishReason.STOP - 'tool_calls'/'function_call' -> FinishReason.STOP - 'content_filter' -> FinishReason.SAFETY - unknown values -> FinishReason.OTHER - Add clarifying comment in tracing.py for string fallback path - Update test_litellm.py to verify enum mapping: - Assert finish_reason is FinishReason enum instance - Verify correct enum values for each finish_reason string - Add test for unknown finish_reason mapping to OTHER Benefits: - Type consistency with Gemini native responses - Avoids runtime warnings from string finish_reason - Enables proper instanceof checks in callbacks - Better integration with ADK telemetry

aperepel · 2025-10-15T03:16:37Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively propagates the finish_reason from LiteLLM responses by updating the LlmResponse model, mapping string reasons to the FinishReason enum, and enhancing telemetry tracing to handle both types. The changes are well-supported by comprehensive new unit tests. I have a couple of suggestions to improve code clarity and documentation accuracy.

src/google/adk/models/lite_llm.py

src/google/adk/models/llm_response.py

aperepel · 2025-10-15T03:26:21Z

/gemini review

gemini-code-assist

Code Review

This pull request does a good job of propagating the finish_reason from LiteLLM responses, which is crucial for observing and handling different model completion scenarios. The changes are well-structured and include comprehensive tests. I've included two suggestions: one to use a more specific enum value for tool calls to avoid information loss, and another to refactor the new tests for improved maintainability.

src/google/adk/models/lite_llm.py

tests/unittests/models/test_litellm.py

aperepel · 2025-10-15T03:35:36Z

/gemini review

gemini-code-assist

Code Review

This pull request does a great job of propagating the finish_reason from LiteLLM responses and mapping them to the standard FinishReason enum. This is a valuable fix that improves observability and allows for better handling of different completion scenarios. The changes are well-implemented, and the addition of comprehensive unit tests ensures the new logic is robust. I have a couple of suggestions to further improve code clarity and maintainability.

src/google/adk/telemetry/tracing.py

tests/unittests/models/test_litellm.py

Maps LiteLLM finish_reason string values to proper FinishReason enum for type consistency with Gemini native responses. Changes: - Add _FINISH_REASON_MAPPING dictionary for string->enum conversion - "length" -> FinishReason.MAX_TOKENS - "stop" -> FinishReason.STOP - "tool_calls"/"function_call" -> FinishReason.STOP - "content_filter" -> FinishReason.SAFETY - Unknown values -> FinishReason.OTHER (fallback) - Update finish_reason type hint to Optional[FinishReason] (no Union needed) - Update telemetry tracing to use .name for enum serialization - Add explanatory comments: - Why tool_calls maps to STOP (no TOOL_CALL enum exists) - Docstring clarifies mapping applies to all model providers Tests: - test_finish_reason_propagation: verifies enum mapping for all values - test_finish_reason_unknown_maps_to_other: verifies fallback behavior Benefits: - Type consistency: finish_reason is always FinishReason enum - No runtime warnings from mixed types - Enables proper isinstance() checks in callbacks - Dictionary mapping improves maintainability - Better integration with ADK telemetry

aperepel · 2025-10-15T03:43:32Z

The bot seems to be reviewing only the last commit, not the whole PR. Squashed changes into one now.

/gemini review

gemini-code-assist

Code Review

This pull request effectively addresses the issue of propagating finish_reason from LiteLLM responses by introducing a mapping to the standard FinishReason enum. The changes are well-implemented across the model, telemetry, and response layers, and are supported by comprehensive new unit tests. My feedback includes a couple of suggestions to improve code maintainability and simplify logic by removing now-redundant defensive code.

src/google/adk/telemetry/tracing.py

tests/unittests/models/test_litellm.py

- Simplify tracing.py by removing isinstance check (always enum now) - Refactor test assertions to use dictionary mapping instead of if/elif - Reduce code duplication and improve readability Addresses Gemini Code Assist bot suggestions: - tracing.py: Direct .name access since finish_reason is always enum - test_litellm.py: Dictionary mapping for cleaner test assertions

aperepel · 2025-10-15T03:50:48Z

/gemini review

gemini-code-assist

Code Review

This pull request correctly propagates the finish_reason from LiteLLM responses, ensuring it's available as a types.FinishReason enum in LlmResponse objects. The changes are well-implemented across lite_llm.py and tracing.py, and the updated docstring in llm_response.py adds clarity. The new unit tests in test_litellm.py are comprehensive and cover various scenarios, including a fallback for unknown reasons. I've added one suggestion to improve the maintainability of the new tests by reducing code duplication.

tests/unittests/models/test_litellm.py

Import and use the actual _FINISH_REASON_MAPPING from lite_llm instead of duplicating it in tests. This ensures tests stay in sync with implementation changes automatically. Benefits: - Single source of truth for finish_reason mappings - Tests automatically reflect any future mapping changes - Reduced code duplication - Better maintainability Addresses review comment: google#3114 (review)

aperepel · 2025-10-15T03:57:32Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively addresses the issue of propagating finish_reason from LiteLLM responses. The introduction of a mapping to FinishReason enums standardizes the field, which is a solid approach for consistency across different models. The changes are well-tested with new unit tests covering various scenarios. The fix in the telemetry code to correctly handle the enum is also a valuable improvement. Overall, this is a well-executed change that enhances the framework's robustness.

src/google/adk/models/llm_response.py

The Union type is no longer needed since finish_reason is always a FinishReason enum (never a string after our mapping). Addresses review comment: google#3114 (comment)

aperepel · 2025-10-15T04:07:23Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively propagates the finish_reason from LiteLLM responses, ensuring consistency by mapping string values to the FinishReason enum. The changes are well-implemented across the model, telemetry, and response modules. The addition of comprehensive unit tests is excellent and covers various scenarios, including the fallback for unknown reasons. I've identified one potential issue regarding list handling that could lead to an IndexError.

src/google/adk/models/lite_llm.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

aperepel · 2025-10-15T04:12:13Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively addresses the propagation of finish_reason from LiteLLM responses by mapping them to the standard types.FinishReason enum. The implementation is robust, handling various completion scenarios and unknown reasons gracefully, which ensures type consistency across different model providers. The corresponding fix in the telemetry code to correctly process the enum value is a crucial improvement. The addition of comprehensive unit tests is excellent, providing strong validation for the new logic. Overall, this is a high-quality change that enhances the framework's reliability and observability.

adk-bot added bot triaged models [Component] Issues related to model support labels Oct 8, 2025

gemini-code-assist bot reviewed Oct 8, 2025

View reviewed changes

src/google/adk/telemetry/tracing.py Outdated Show resolved Hide resolved

src/google/adk/models/lite_llm.py Outdated Show resolved Hide resolved

tests/unittests/models/test_litellm.py Show resolved Hide resolved

gemini-code-assist bot reviewed Oct 8, 2025

View reviewed changes

src/google/adk/models/lite_llm.py Outdated Show resolved Hide resolved

src/google/adk/models/lite_llm.py Outdated Show resolved Hide resolved

src/google/adk/telemetry/tracing.py Outdated Show resolved Hide resolved

aperepel and others added 3 commits October 8, 2025 10:51

Update src/google/adk/models/lite_llm.py

931fd01

Addressing review comments Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update src/google/adk/telemetry/tracing.py

ad83872

Addressing review comments Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

gemini-code-assist bot reviewed Oct 8, 2025

View reviewed changes

src/google/adk/models/llm_response.py Outdated Show resolved Hide resolved

boyangsvl removed the bot triaged label Oct 8, 2025

Merge branch 'main' into fix/litellm-finish-reason-3109

75484f8