Added HHCache class implementing H2O Cache #31623

belericant · 2024-06-26T02:02:04Z

What does this PR do?

This PR adds the feature requested in #30758. The HHCache class is almost directly taken from the original H2O paper's authors code found here. Currently the PR only adds the changes required to Llama model class. As of now I have taken @gante 's suggestion of adding Cache.post_process() and calling it within LlamaAttention.forward.

To-Do

I'm not sure if the logic for RoPE rerotation is 100% correct. I think the recent tokens are correct, but not the hh tokens after eviction. Would love to have another set of eyes on that.
Write tests to ensure that this HHCache class has the same behavior compared to the original code by paper authors.
Benchmarking(?)

Feedback and/or help would be appreciated. Thanks!

amyeroberts · 2024-06-26T10:57:39Z

cc @gante @ArthurZucker

github-actions · 2024-07-26T08:03:46Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

ArthurZucker · 2024-08-05T07:50:37Z

RE-opened as we were waiting for the todos, @belericant should I close it?

belericant added 2 commits June 25, 2024 18:43

add hh cache class

e9bf7cb

llama model support for HHCache

2420148

amyeroberts added Cache labels Jun 26, 2024

github-actions bot closed this Aug 3, 2024

ArthurZucker reopened this Aug 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added HHCache class implementing H2O Cache #31623

Added HHCache class implementing H2O Cache #31623

Uh oh!

belericant commented Jun 26, 2024

Uh oh!

amyeroberts commented Jun 26, 2024

Uh oh!

github-actions bot commented Jul 26, 2024

Uh oh!

ArthurZucker commented Aug 5, 2024

Uh oh!

Uh oh!

Added HHCache class implementing H2O Cache #31623

Are you sure you want to change the base?

Added HHCache class implementing H2O Cache #31623

Uh oh!

Conversation

belericant commented Jun 26, 2024

What does this PR do?

To-Do

Uh oh!

amyeroberts commented Jun 26, 2024

Uh oh!

github-actions bot commented Jul 26, 2024

Uh oh!

ArthurZucker commented Aug 5, 2024

Uh oh!

Uh oh!