chore: Added default string to HuggingfacePostTrainingConfig #3109

cheesecake100201 · 2025-08-12T13:22:15Z

What does this PR do?

Adds Default string to HuggingfacePostTrainingConfig as this was not letting llama stack server spin up for older builds (Not tried for newer ones, tried for starter-run.yaml that I had created before)

Error Received
Traceback (most recent call last): File "<frozen runpy>", line 198, in _run_module_as_main File "<frozen runpy>", line 88, in _run_code File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/llama_stack/core/server/server.py", line 625, in <module> main() File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/llama_stack/core/server/server.py", line 438, in main impls = loop.run_until_complete(construct_stack(config)) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/asyncio/base_events.py", line 687, in run_until_complete return future.result() ^^^^^^^^^^^^^^^ File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/llama_stack/core/stack.py", line 322, in construct_stack impls = await resolve_impls( ^^^^^^^^^^^^^^^^^^^^ File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/llama_stack/core/resolver.py", line 168, in resolve_impls return await instantiate_providers(sorted_providers, router_apis, dist_registry, run_config, policy) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/llama_stack/core/resolver.py", line 294, in instantiate_providers impl = await instantiate_provider(provider, deps, inner_impls, dist_registry, run_config, policy) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/llama_stack/core/resolver.py", line 375, in instantiate_provider config = config_type(**provider.config) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/sarthakdeshpande/Desktop/open source/llama-stack/.venv/lib/python3.12/site-packages/pydantic/main.py", line 214, in __init__ validated_self = self.__pydantic_validator__.validate_python(data, self_instance=self) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ pydantic_core._pydantic_core.ValidationError: 1 validation error for HuggingFacePostTrainingConfig dpo_output_dir Field required [type=missing, input_value={'checkpoint_format': 'hu...: None, 'device': 'cpu'}, input_type=dict] For further information visit https://errors.pydantic.dev/2.10/v/missing ++ error_handler 128 ++ echo 'Error occurred in script at line: 128' Error occurred in script at line: 128 ++ exit 1

cdoern · 2025-08-12T14:04:42Z

llama_stack/providers/inline/post_training/huggingface/config.py

@@ -71,7 +71,7 @@ class HuggingFacePostTrainingConfig(BaseModel):
    dpo_beta: float = 0.1
    use_reference_model: bool = True
    dpo_loss_type: Literal["sigmoid", "hinge", "ipo", "kto_pair"] = "sigmoid"
-    dpo_output_dir: str
+    dpo_output_dir: str = ""


Suggested change

dpo_output_dir: str = ""

dpo_output_dir: str | None = None

I think None is a more suitable default rather than an empty string, some checks for None would be great too in the code, thanks!

Makes sense will make the same change

ashwinb · 2025-08-12T15:52:07Z

llama_stack/providers/inline/post_training/huggingface/config.py

@@ -71,7 +71,7 @@ class HuggingFacePostTrainingConfig(BaseModel):
    dpo_beta: float = 0.1
    use_reference_model: bool = True
    dpo_loss_type: Literal["sigmoid", "hinge", "ipo", "kto_pair"] = "sigmoid"
-    dpo_output_dir: str
+    dpo_output_dir: str | None = None


I see, yes this is a backwards incompatible change as you noted. We should make that a runtime check where the provider yells if this value continues to be None at runtime as @cdoern notes.

So do we need to change anything else here or is this good ?

Currently its throwing stack trace error, do we need to show this a log error only or something else?

you'll need to add checks in the code which uses dpo_output_dir. This is the source of the errors you are seeing, also please run pre-commit to re-generate the openAPI schema

Could see only one place where dpo_output_dir was being used have added a check there also, please review again once

@cdoern I am unable to run pre-commit because of some dependencies issues coming up because of upgrading of my laptop, solely because of change in architecture

cheesecake100201 · 2025-08-25T06:29:10Z

@cdoern Please review

franciscojavierarceo

Please fix precommit. I have faith in you to resolve your upgrade issues. 😎

chore: Added default string to HuggingfacePostTrainingConfig

db6693e

cheesecake100201 requested review from ashwinb, yanxi0830, hardikjshah, raghotham, ehhuang, terrytangyuan, leseb, bbrowning, reluctantfuturist, mattf and slekkala1 as code owners August 12, 2025 13:22

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 12, 2025

cdoern suggested changes Aug 12, 2025

View reviewed changes

chore:Updated empty string to None

7713142

ashwinb reviewed Aug 12, 2025

View reviewed changes

cheesecake100201 and others added 2 commits August 12, 2025 21:52

Merge branch 'main' into HuggingfacePostTrainingConfig-branch

8768432

Merge branch 'main' into HuggingfacePostTrainingConfig-branch

6520a6d

cheesecake100201 force-pushed the HuggingfacePostTrainingConfig-branch branch from bfc4379 to 6520a6d Compare August 14, 2025 07:07

cheesecake100201 and others added 2 commits August 19, 2025 22:39

Merge branch 'main' into HuggingfacePostTrainingConfig-branch

5ce2f00

chore:Added a check for if dpo_output_dir exists in config

75bdc7b

cheesecake100201 added 3 commits August 25, 2025 11:59

Merge branch 'main' into HuggingfacePostTrainingConfig-branch

d0d7376

Merge branch 'main' into HuggingfacePostTrainingConfig-branch

0390054

Merge branch 'main' into HuggingfacePostTrainingConfig-branch

66f4af7

franciscojavierarceo requested changes Aug 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: Added default string to HuggingfacePostTrainingConfig #3109

chore: Added default string to HuggingfacePostTrainingConfig #3109

cheesecake100201 commented Aug 12, 2025 •

edited

Loading

Uh oh!

cdoern Aug 12, 2025

Uh oh!

cheesecake100201 Aug 12, 2025

Uh oh!

ashwinb Aug 12, 2025

Uh oh!

cheesecake100201 Aug 12, 2025

Uh oh!

cheesecake100201 Aug 12, 2025

Uh oh!

cdoern Aug 18, 2025

Uh oh!

cheesecake100201 Aug 19, 2025

Uh oh!

cheesecake100201 Aug 20, 2025

Uh oh!

cheesecake100201 commented Aug 25, 2025

Uh oh!

franciscojavierarceo left a comment

Uh oh!

Uh oh!

chore: Added default string to HuggingfacePostTrainingConfig #3109

Are you sure you want to change the base?

chore: Added default string to HuggingfacePostTrainingConfig #3109

Conversation

cheesecake100201 commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cheesecake100201 commented Aug 25, 2025

Uh oh!

franciscojavierarceo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cheesecake100201 commented Aug 12, 2025 •

edited

Loading