Skip to content

Releases: ggml-org/llama.cpp

b7050

13 Nov 18:46
dd091e5

Choose a tag to compare

sched : fix reserve ignoring user tensor assignments (#17232)

b7049

13 Nov 17:50
1215dde

Choose a tag to compare

ggml-cpu : add RISC-V vector intrinsic support for silu and cvar oper…

b7048

13 Nov 17:34
0cfb191

Choose a tag to compare

metal: accelerated conv2d (#17175)

* metal: accelerated conv2d

* cont : cleanup

---------

Co-authored-by: bghira <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>

b7047

13 Nov 16:11
2776db6

Choose a tag to compare

Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233)

This reverts commit 1c398dc9eca9c366ce98deb0e6f3538e444ebc8a.

b7046

13 Nov 10:17
879dec3

Choose a tag to compare

ggml-cpu : use template for argsort (#17222)

b7045

13 Nov 02:30
97d5117

Choose a tag to compare

CANN: Add cross_entropy_loss op support (#16886)

* update L2_NORM op support

* update L2_NORM op support

* remove extra whitespace

* cann: update cross_entropy_loss op support

* remove trailing whitespaces

* rebase the latest code in the main repository and remove the l2_norm operator that already exists in another pull request.

* undo the l2_norm operator deletion

b7044

13 Nov 01:53
a90eb94

Choose a tag to compare

CUDA: fuse rope + set_rows (#16884)

* CUDA: add fused rope

* move k forward_expand up

* create helper function instead of re-using params

* make assert statement more in line with comment

* rope_norm: coalesced writes to global mem

b7042

13 Nov 01:35
ffb6f3d

Choose a tag to compare

vocab : correct bounds check for UGM XCDA array access (#17215)

b7041

13 Nov 01:15
5d6838b

Choose a tag to compare

CUDA: static assert to prevent misuse of memcpy_1 (#17198)

b7039

12 Nov 20:13
374fe09

Choose a tag to compare

ggml : use std::sort in ggml_argsort CPU implementation (#17211)

* ggml : use std::sort in ggml_argsort CPU implementation

* cont : add missing header