[OpenVINO backend] supporting inference for Gemma, Mistral and GPT2 with ov backend #2310

Mohamed-Ashraf273 · 2025-06-22T15:58:03Z

Description of the change

As a part of my GSoC25 project to support inference with the openvino backend for Gemma , Mistral and GPT-2,
This is my PR for supporting Gemma , Mistral and GPT-2 pipelines.

import os
os.environ["KERAS_BACKEND"] = "openvino"
import keras_hub

model = keras_hub.models.GPT2CausalLM.from_preset(
    "gpt2_large_en", dtype="float16"
)
model.summary()
output = model.generate("Keras is ", max_length=20)
print("Generated text:", output)

Reference

https://docs.openvino.ai/2025/index.html
https://keras.io/api/
https://keras.io/keras_hub/

Colab Notebook

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and works with all backends (TensorFlow, JAX, and PyTorch).
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have followed the Keras Hub Model contribution guidelines in making these changes.
I have followed the Keras Hub API design guidelines in making these changes.
I have signed the Contributor License Agreement.

Mohamed-Ashraf273 · 2025-07-14T16:23:54Z

@fchollet
@mattdangerw
@rkazants
@divyashreepathihalli

Mohamed-Ashraf273 · 2025-07-21T19:09:38Z

@mattdangerw
@divyashreepathihalli

github-actions bot added the Gemma Gemma model specific issues label Jun 22, 2025

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch 26 times, most recently from 6576b03 to 074f0c2 Compare June 23, 2025 17:26

[OpenVINO backend] supporting inference for gemma with ov backend

692ae90

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch 2 times, most recently from d748dd5 to f5470cd Compare June 24, 2025 13:36

exclude hgnetv2 model tests

3d6a09b

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from c36b8e5 to 3d6a09b Compare July 15, 2025 20:44

Mohamed-Ashraf273 requested a review from divyashreepathihalli July 16, 2025 10:52

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from cbd322e to e59d313 Compare July 16, 2025 11:58

Merge branch 'master' into supporting_gemma_inference_with_ov_backend

41f9a0f

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from e59d313 to 41f9a0f Compare July 16, 2025 12:01

Mohamed-Ashraf273 added 2 commits July 16, 2025 15:10

disable export test

407c700

add openvino_utils_test

cb6f1e9

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from 8a51525 to cb6f1e9 Compare July 18, 2025 11:27

Mohamed-Ashraf273 added 2 commits July 18, 2025 14:35

Merge branch 'master' into supporting_gemma_inference_with_ov_backend

cc81429

disable dinov2 tests for openvino backend

76845af

Mohamed-Ashraf273 mentioned this pull request Jul 18, 2025

[Performance] High Memory Usage During GPT-2 Generation Using OpenVINO Backend on Keras 3 Compared to other backends openvinotoolkit/openvino#31390

Open

3 tasks

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from 381ac68 to 6390270 Compare July 19, 2025 13:58

remove model reusing in openvino backend

b798a4f

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from 6390270 to b798a4f Compare July 20, 2025 11:53

Mohamed-Ashraf273 marked this pull request as draft July 20, 2025 12:45

Mohamed-Ashraf273 mentioned this pull request Jul 20, 2025

Simulated OpenVINO Backend for Testing Unmerged PR Features with Memory Profiling keras-team/keras#21491

Closed

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch 4 times, most recently from 9f91fdb to 5003c9e Compare July 21, 2025 11:26

making the model dynmaic for openvino backend

e6ef629

Mohamed-Ashraf273 force-pushed the supporting_gemma_inference_with_ov_backend branch from 5003c9e to e6ef629 Compare July 21, 2025 12:05

Mohamed-Ashraf273 marked this pull request as ready for review July 21, 2025 14:00

Mohamed-Ashraf273 mentioned this pull request Jul 22, 2025

Simulated OpenVINO Backend for Testing Unmerged PR Features with Memory Profiling keras-team/keras#21500

Draft

3 tasks

Mohamed-Ashraf273 added 3 commits July 22, 2025 22:44

remove reshaping ops at position_embedding.py

62e1cce

[OpenVIno backend] add max_length check

d98fc8b

test_cache problem solved

178aba2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenVINO backend] supporting inference for Gemma, Mistral and GPT2 with ov backend #2310

[OpenVINO backend] supporting inference for Gemma, Mistral and GPT2 with ov backend #2310

Uh oh!

Mohamed-Ashraf273 commented Jun 22, 2025 •

edited

Loading

Uh oh!

Mohamed-Ashraf273 commented Jul 14, 2025

Uh oh!

Mohamed-Ashraf273 commented Jul 21, 2025

Uh oh!

Uh oh!

[OpenVINO backend] supporting inference for Gemma, Mistral and GPT2 with ov backend #2310

Are you sure you want to change the base?

[OpenVINO backend] supporting inference for Gemma, Mistral and GPT2 with ov backend #2310

Uh oh!

Conversation

Mohamed-Ashraf273 commented Jun 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Reference

Colab Notebook

Checklist

Uh oh!

Mohamed-Ashraf273 commented Jul 14, 2025

Uh oh!

Mohamed-Ashraf273 commented Jul 21, 2025

Uh oh!

Uh oh!

Mohamed-Ashraf273 commented Jun 22, 2025 •

edited

Loading