-
Notifications
You must be signed in to change notification settings - Fork 14k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
metal : attach residency sets to queue
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
#17766
opened Dec 4, 2025 by
ggerganov
Loading…
Add a search field on model selector / improve mobile display
examples
server
#17765
opened Dec 4, 2025 by
ServeurpersoCom
Loading…
ggml webgpu: unary op suppport, code refactoring, ops support
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#17764
opened Dec 4, 2025 by
reeselevine
Loading…
Fix too stringent check on CUDA "fast copy" (can_be_transposed) condition and extend with one more case
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#17759
opened Dec 4, 2025 by
bssrdf
Loading…
CANN: Refactor issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
is_matched_graph for better maintainability
Ascend NPU
#17758
opened Dec 4, 2025 by
rauletorresc
Loading…
cann: refactor ACL graph cache
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17752
opened Dec 4, 2025 by
wangweixuan
Loading…
Fix race conditions in threadpool when dealing with dynamic/frequent n_threads changes
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#17748
opened Dec 4, 2025 by
max-krasnyansky
Loading…
CUDA: fix FA VKQ accumulator overflow
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17746
opened Dec 3, 2025 by
JohannesGaessler
Loading…
Remove deprecated trigger_words support from grammar sampler
#17745
opened Dec 3, 2025 by
yifant-code
•
Draft
model: add llama 4 scaling for mistral-large (deepseek arch)
model
Model specific
#17744
opened Dec 3, 2025 by
ngxson
Loading…
CANN: implement the SSM_CONV operator
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#17737
opened Dec 3, 2025 by
0Marble
Loading…
convert: support Mistral 3 Large MoE
python
python script changes
#17730
opened Dec 3, 2025 by
ngxson
Loading…
CANN: In the ROPE operator, yarn_ramp uses cache
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17725
opened Dec 3, 2025 by
TianHao324
Loading…
common : add parser for ministral/mistral large 3
documentation
Improvements or additions to documentation
examples
server
testing
Everything test related
#17713
opened Dec 3, 2025 by
aldehir
Loading…
vulkan: Use one row per workgroup for f32 mmv
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17711
opened Dec 3, 2025 by
jeffbolznv
Loading…
build: for GGML_BACKEND_DL, ggml need not depend on backend
ggml
changes relating to the ggml tensor library for machine learning
#17709
opened Dec 3, 2025 by
jeffbolznv
Loading…
common: Deepseek V3.2 tool call parser
testing
Everything test related
#17707
opened Dec 3, 2025 by
hksdpc255
Loading…
CANN: Support fusion operator that supports mul and add
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#17706
opened Dec 3, 2025 by
TianHao324
•
Draft
cuda: optimize SOLVE_TRI using registers and FMAF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17703
opened Dec 2, 2025 by
wsbagnsv1
Loading…
vulkan: add more num_blocks instantiations in rms_norm
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17701
opened Dec 2, 2025 by
jeffbolznv
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.