Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

llama : minor coding style fix for smollm3
#14605 opened Jul 9, 2025 by ngxson Loading…
cuda : support Falcon-H1 state size for SSM_SCAN ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14602 opened Jul 9, 2025 by compilade Loading…
kv-cache : opt mask set input
#14600 opened Jul 9, 2025 by ggerganov Loading…
Docs: script to auto-generate ggml operations docs devops improvements to build systems and github actions documentation Improvements or additions to documentation python python script changes script Script related testing Everything test related
#14598 opened Jul 9, 2025 by am17an Loading…
Smoldocling support python python script changes
#14597 opened Jul 9, 2025 by ryan-mangeno Loading…
metal : fuse add Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#14596 opened Jul 9, 2025 by ggerganov Draft
docker : add cann build pipline devops improvements to build systems and github actions
#14591 opened Jul 9, 2025 by diannaojiang Loading…
vulkan: support SET_ROWS ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14587 opened Jul 9, 2025 by jeffbolznv Loading…
metal : reuse graphs Apple Metal https://en.wikipedia.org/wiki/Metal_(API) demo Demonstrate some concept or idea, not intended to be merged ggml changes relating to the ggml tensor library for machine learning
#14570 opened Jul 7, 2025 by ggerganov Draft
SYCL: Initial set_rows kernel implementation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14562 opened Jul 7, 2025 by qnixsynapse Loading…
model : add PLaMo-2 model examples python python script changes
#14560 opened Jul 7, 2025 by mitmul Loading…
vulkan: optimizations for deepseek prompt processing ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14555 opened Jul 6, 2025 by jeffbolznv Loading…
CUDA: add set rows for f32 and f16 examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14551 opened Jul 6, 2025 by am17an Loading…
opencl: add set_rows for f16 and f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14547 opened Jul 6, 2025 by lhez Loading…
OpenCL: add tiled mul_mat_f16_f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14535 opened Jul 4, 2025 by rmatif Loading…
ggml: fix typo in ggml.c ggml changes relating to the ggml tensor library for machine learning
#14531 opened Jul 4, 2025 by zhouwg Loading…
ggml: Add initial WebGPU backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning python python script changes
#14521 opened Jul 3, 2025 by reeselevine Loading…
MUSA: upgrade musa sdk to <<TBD>> ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14498 opened Jul 2, 2025 by yeahdongcn Draft
Allow truncation when embedding examples server
#14493 opened Jul 2, 2025 by huydt84 Loading…
ProTip! Follow long discussions with comments:>50.