Skip to content

Releases: ggml-org/llama.cpp

b7139

24 Nov 04:27
923ae3c

Choose a tag to compare

hexagon: add support for ROPE_NEOX (#17458)

b7138

24 Nov 02:29
01ad35e

Choose a tag to compare

CANN: Define `cann_graph_update_required` before macro (#17434)

**Description of the problem**

`cann_graph_update_required` is redundantly defined and
initialized as `false` inside two mutually exclusive macro branches.

**Proposed solution**

Define it right before the macro so that it could serve both
branches.

b7137

24 Nov 01:26
fcb0138

Choose a tag to compare

ggml-hexagon: Initial Hexagon v68/v69 support  (#17394)

* ggml-hexagon: fix build error with GCC

Add stdexcept include to fix GCC build errors

Signed-off-by: Mohamed Mediouni <[email protected]>

* ggml-hexagon: check VTCM acquire failures

Signed-off-by: Mohamed Mediouni <[email protected]>

* ggml-hexagon: disable destination bypass on older than v73

v68 errors out if having bypass enabled when the VTCM is the destination.

At least on v68 this made things actually work... not a proper fix though, so to look at later...

Signed-off-by: Mohamed Mediouni <[email protected]>

* ggml-hexagon: add initial v68/v69 support

v68 is the Hexagon revision notably used on the Snapdragon 8cx
Gen 3 and the QCM6490.

Also add support for v69.

8MB isn't a supported page size, so relax asked for page size constraint
for HAP_compute_res_attr_set_vtcm_param_v2 to optimal.

Signed-off-by: Mohamed Mediouni <[email protected]>

---------

Signed-off-by: Mohamed Mediouni <[email protected]>

b7136

24 Nov 00:18
d5bc1ad

Choose a tag to compare

ggml-hexagon: add `hex_supported_buffer` for better buffer supported …

b7134

23 Nov 10:51
96ac5a2

Choose a tag to compare

cuda : support non-contiguous i32 to i32 copy (#17326)

* support non-contiguous i32 to i32 copy

* add tests

* rename cpy_flt to cpy_scalar and reindent params

b7132

23 Nov 07:03
54d83bb

Choose a tag to compare

vulkan: remove a couple unnecessary switches (#17419)

b7130

22 Nov 10:26
3f3a4fb

Choose a tag to compare

Revive MUL_MAT_ID to perf testing (#17397)

b7129

22 Nov 00:38
028f93e

Choose a tag to compare

HIP: RDNA4 tensor core support for MMF (#17077)

* mmf for rdna4

* align the padding for rdna4

* forbit mul_mat_f for rdna4

* fix as comment

* remove device kernels

* add constexpr for early return

* update based on review comment

* change based on the review comment

* pass compile error

* keep code consistency

---------

Co-authored-by: zhang hui <[email protected]>

b7128

21 Nov 23:16
8e9ddba

Choose a tag to compare

opencl: refine condition for kqv mm (#17392)