Releases · ggml-org/llama.cpp

13 Nov 18:46

dd091e5

b7050

sched : fix reserve ignoring user tensor assignments (#17232)

Assets 16

13 Nov 17:50

github-actions

b7049

1215dde

b7049

ggml-cpu : add RISC-V vector intrinsic support for silu and cvar oper…

Assets 16

13 Nov 17:34

github-actions

b7048

0cfb191

b7048

metal: accelerated conv2d (#17175)

* metal: accelerated conv2d

* cont : cleanup

---------

Co-authored-by: bghira <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>

Assets 16

13 Nov 16:11

github-actions

b7047

2776db6

b7047

Revert "ggml-cpu: handle 3d tensors in repack mat_mul (#17030)" (#17233)

This reverts commit 1c398dc9eca9c366ce98deb0e6f3538e444ebc8a.

Assets 16

13 Nov 10:17

github-actions

b7046

879dec3

b7046

ggml-cpu : use template for argsort (#17222)

Assets 16

13 Nov 02:30

github-actions

b7045

97d5117

b7045

CANN: Add cross_entropy_loss op support (#16886)

* update L2_NORM op support

* update L2_NORM op support

* remove extra whitespace

* cann: update cross_entropy_loss op support

* remove trailing whitespaces

* rebase the latest code in the main repository and remove the l2_norm operator that already exists in another pull request.

* undo the l2_norm operator deletion

Assets 16

13 Nov 01:53

github-actions

b7044

a90eb94

b7044

CUDA: fuse rope + set_rows (#16884)

* CUDA: add fused rope

* move k forward_expand up

* create helper function instead of re-using params

* make assert statement more in line with comment

* rope_norm: coalesced writes to global mem

Assets 16

13 Nov 01:35

github-actions

b7042

ffb6f3d

b7042

vocab : correct bounds check for UGM XCDA array access (#17215)

Assets 16

13 Nov 01:15

github-actions

b7041

5d6838b

b7041

CUDA: static assert to prevent misuse of memcpy_1 (#17198)

Assets 16

12 Nov 20:13

github-actions

b7039

374fe09

b7039

ggml : use std::sort in ggml_argsort CPU implementation (#17211)

* ggml : use std::sort in ggml_argsort CPU implementation

* cont : add missing header

Assets 16

Releases: ggml-org/llama.cpp

b7050

Uh oh!

b7049

Uh oh!

b7048

Uh oh!

b7047

Uh oh!

b7046

Uh oh!

b7045

Uh oh!

b7044

Uh oh!

b7042

Uh oh!

b7041

Uh oh!

b7039

Uh oh!