-
Notifications
You must be signed in to change notification settings - Fork 19.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SYCL] support OPs: conv_2d, conv_2d_dw, conv2d_transpose
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24600
opened Jun 14, 2026 by
arthw
Contributor
Loading…
ci: fix vulkan docker images
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#24595
opened Jun 13, 2026 by
Kononnable
Loading…
spec: support eagle3 for qwen3.5 & 3.6
examples
model
Model specific
server
#24593
opened Jun 13, 2026 by
ruixiang63
Contributor
Loading…
hexagon: support for op-trace (fine-grain tracing of HVX/HMX/DMA events)
ggml
changes relating to the ggml tensor library for machine learning
Hexagon
python
python script changes
script
Script related
#24592
opened Jun 13, 2026 by
max-krasnyansky
Member
•
Draft
llama : suppress misleading Gemma4Assistant error during memory fitting
#24590
opened Jun 13, 2026 by
leotm
Loading…
HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#24588
opened Jun 13, 2026 by
DEV-DUFORD
Loading…
vulkan: add iq4_nl support back to FA
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#24585
opened Jun 13, 2026 by
jeffbolznv
Contributor
Loading…
[SYCL] add to support pool_1d, move pool_1d/2d code to pool.cpp/hpp
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24584
opened Jun 13, 2026 by
arthw
Contributor
Loading…
vulkan: support all backend tests for SQR/SQRT/SIN/COS/CLAMP/LEAKY_RELU/NORM
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
WebGPU
#24582
opened Jun 13, 2026 by
jeffbolznv
Contributor
Loading…
vulkan: Support gated_delta_net with S_v=16
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#24581
opened Jun 13, 2026 by
jeffbolznv
Contributor
Loading…
vulkan: support more CONCAT types
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#24579
opened Jun 13, 2026 by
jeffbolznv
Contributor
Loading…
[SYCL]fix reorder function crash:GGML_ASSERT(block_num_y % num_subgroups ==0)
examples
ggml
changes relating to the ggml tensor library for machine learning
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24578
opened Jun 13, 2026 by
arthw
Contributor
Loading…
ggml: optimize concat op by replacing per-element memcpy with row-level memcpy
ggml
changes relating to the ggml tensor library for machine learning
#24575
opened Jun 13, 2026 by
sirohikartik
Contributor
Loading…
CI: Replace flake8-no-print with flake8-debug and pin repos to hashes
#24572
opened Jun 13, 2026 by
jpodivin
Contributor
Loading…
CUDA: Add conv3d.
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#24569
opened Jun 13, 2026 by
Sero1000
Loading…
EXPERIMENT: meta: key external view cache by backend context
ggml
changes relating to the ggml tensor library for machine learning
#24566
opened Jun 13, 2026 by
nycdubliner
•
Draft
[fattn-tune] Add Blackwell MMA config
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#24565
opened Jun 13, 2026 by
yaohengxu
Contributor
Loading…
[SYCL] Enhance set_rows to support q1_0, mxfp4, nvfp4
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24564
opened Jun 13, 2026 by
arthw
Contributor
Loading…
CUDA: don't route RDNA3.5 flash attention to the rocWMMA kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#24562
opened Jun 13, 2026 by
liminfei-amd
Loading…
1 task done
CUDA/HIP: chunked MFMA prefill kernel for GATED_DELTA_NET (CDNA)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#24561
opened Jun 13, 2026 by
jadenmach2
Contributor
Loading…
ggml-alloc : check realloc result in alloc_tensor_range
ggml
changes relating to the ggml tensor library for machine learning
#24559
opened Jun 13, 2026 by
ricku777-bear
Loading…
Fix 24486: TP: allows the usage of 8,9,10 gpus for stepfun
#24554
opened Jun 13, 2026 by
krampenschiesser
•
Draft
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.