Releases · createthis/llama.cpp

19 Jul 16:19

f0d4d17

b5939

Documentation: Update build.md's Vulkan section (#14736)

* Documentation: Rewrote and updated the "Without docker" portion of the Vulkan backend build documentation.

* Documentation: Reorganize build.md's Vulkan section.

Assets 15

15 Jul 01:23

github-actions

b5897

bdca383

b5897

sycl: Hotfix for non dnnl codepath (#14677)

Assets 15

05 Jul 16:31

github-actions

b5833

a0374a6

b5833

vulkan: Handle updated FA dim2/3 definition (#14518)

* vulkan: Handle updated FA dim2/3 definition

Pack mask boolean and n_head_log2 into a single dword to keep the push
constant block under the 128B limit.

* handle null mask for gqa

* allow gqa with dim3>1

Assets 15

15 Jun 19:56

github-actions

b5672

30e5b01

b5672

quantize : change int to unsigned int for KV overrides (#14197)

Assets 15

09 Jun 03:35

github-actions

b5606

247e5c6

b5606

cuda : fix buffer type check with integrated GPUs (#14069)

Assets 15

07 Jun 03:00

github-actions

b5602

745aa53

b5602

llama : deprecate llama_kv_self_ API (#14030)

* llama : deprecate llama_kv_self_ API

ggml-ci

* llama : allow llama_memory_(nullptr)

ggml-ci

* memory : add flag for optional data clear in llama_memory_clear

ggml-ci

Assets 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: createthis/llama.cpp

b5939

Uh oh!

b5897

Uh oh!

b5833

Uh oh!

b5672

Uh oh!

b5606

Uh oh!

b5602

Uh oh!