Safetensors conversion #2290

Bond099 · 2025-06-06T09:11:57Z

Description of the change

Reference

Colab Notebook

https://colab.research.google.com/drive/1naqf0sO2J40skndWbVMeQismjL7MuEjd?usp=sharing

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and works with all backends (TensorFlow, JAX, and PyTorch).
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have followed the Keras Hub Model contribution guidelines in making these changes.
I have followed the Keras Hub API design guidelines in making these changes.
I have signed the Contributor License Agreement.

abheesht17 · 2025-06-06T16:27:27Z

Thanks for the PR, will take a look in a bit :)

mattdangerw

Thanks! Just left some initial comments.

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

mattdangerw

Let's add a unit test that calls this util and tries loading the result with transformers and seeing if it works. OK to add transformers to our ci environment here https://github.com/keras-team/keras-hub/blob/master/requirements-common.txt

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

mattdangerw · 2025-06-13T01:15:58Z

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

+import os
+
+import torch
+from safetensors.torch import save_file


does this work on all backends? or do we need to flip between versions depending on the backend? worth testing out

… into safetensors_conversion merge updated branch

mattdangerw

Nice! Please address the changes from the earlier PR as well

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

abheesht17

Thanks, nice work!

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

abheesht17 · 2025-06-19T15:57:14Z

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

+    return hf_config
+
+
+def export_to_hf(keras_model, path):


We should add the API export decorator here, similar to this: https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/models/bloom/bloom_backbone.py#L15-L16

Also, do you think we should refactor some of the common code across models to a separate file? We can then expose that as the API.

So, this is how the directory keras_hub/src/utils/transformers/convert_to_safetensor/ will look like:

export.py: this will have the common code. We will expose this as the API. This will also check if we support safetensor conversion for a given passed model yet.

gemma.py: this will just have a way to create the weight dictionary for Gemma. Inside export.py, we will call the the weight conversion function specific to a specified model.

Pinging @mattdangerw to confirm if we should do this now or at a later point.

I think we could land and do the API bit a later point. Though agree it's an important concern. I'm not sure if we want a method like model.save_to_preset() or a function like some_export(model). Any thoughts?

I think structuring the export logic with a utility function (export_to_hf) and model-specific mappings (gemma.py) will enhance scalability and maintainability. New models can be added by creating a new file, while existing tests only need an import update.

+1 to Abheesht's comment we need an API instead of a script for Gemma, we already have that
https://github.com/keras-team/keras-hub/blob/master/tools/gemma/export_gemma_to_hf.py

… into safetensors_conversion

abheesht17

Leaving comments since I don't see the changes we discussed last week.

abheesht17 · 2025-07-02T23:46:24Z

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

+    @pytest.mark.large
+    def test_export_to_hf(self):
+        # Load Keras model
+        keras_model = GemmaCausalLM.from_preset("gemma_2b_en")


We discussed this last week. In order to make GPU tests work, we need to use a smaller, randomly initialised Gemma model so that we don't hit OOM.

abheesht17 · 2025-07-02T23:48:01Z

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

+    Args:
+        keras_model: The Keras Gemma model (e.g., GemmaCausalLM) to convert.
+        path: str. Path of the directory to which the safetensors file,
+        config and tokenizer will be saved.


Indent this

abheesht17 · 2025-07-02T23:49:23Z

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

+from safetensors.flax import save_file as flax_save_file
+from safetensors.tensorflow import save_file as tf_save_file
+from safetensors.torch import save_file as torch_save_file


Discussed last week. We are supposed to import these conditionally, we don't want to import all of these in every case. If backend is JAX, import the Flax one, if backend is Torch, import the torch one, etc. You can raise an ImportError if they are not present. Maybe, something like this?

keras-hub/keras_hub/src/utils/transformers/safetensor_utils.py

Lines 9 to 12 in 25c9062

try:

import safetensors

except ImportError:

safetensors = None

Bond099 added 2 commits June 6, 2025 14:33

Safetensors conversion

903733b

Reformatted

9f99030

mattdangerw reviewed Jun 6, 2025

View reviewed changes

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

abheesht17 reviewed Jun 6, 2025

View reviewed changes

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

corrected and formatted into a util file

c896fdb

mattdangerw reviewed Jun 13, 2025

View reviewed changes

Bond099 added 6 commits June 13, 2025 15:58

test cases wip

219bf37

Merge branch 'keras-team:master' into safetensors_conversion

b5cf25c

Merge branch 'safetensors_conversion' of github.com:Bond099/keras-hub…

6eaa954

… into safetensors_conversion merge updated branch

unit tests for safetensors conversion

2cbedc4

rename vocab.spm

bbb2042

reformatted

df2951a

mattdangerw added the kokoro:force-run Runs Tests on GPU label Jun 19, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 19, 2025

mattdangerw requested changes Jun 19, 2025

View reviewed changes

abheesht17 reviewed Jun 19, 2025

View reviewed changes

Bond099 added 4 commits June 20, 2025 19:38

Merge branch 'keras-team:master' into safetensors_conversion

aa5f7e0

address comments

ab27a73

Merge branch 'safetensors_conversion' of github.com:Bond099/keras-hub…

cda19d3

… into safetensors_conversion

minor changes

f31ad26

abheesht17 added the kokoro:force-run Runs Tests on GPU label Jun 24, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 24, 2025

mattdangerw mentioned this pull request Jun 26, 2025

Add save hf weights #2312

Closed

Bond099 added 3 commits July 1, 2025 18:56

Merge branch 'keras-team:master' into safetensors_conversion

a9253c0

backend agnostic

4045ce6

Merge branch 'safetensors_conversion' of github.com:Bond099/keras-hub…

bbc05a6

… into safetensors_conversion

abheesht17 added the kokoro:force-run Runs Tests on GPU label Jul 2, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jul 2, 2025

abheesht17 requested changes Jul 2, 2025

View reviewed changes

Safetensors conversion #2290

Are you sure you want to change the base?

Safetensors conversion #2290

Uh oh!

Conversation

Bond099 commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Reference

Colab Notebook

Checklist

Uh oh!

abheesht17 commented Jun 6, 2025

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Bond099 commented Jun 6, 2025 •

edited

Loading

abheesht17 Jun 19, 2025 •

edited

Loading