Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Feat/model selection UI examples server/ui
#24604 opened Jun 14, 2026 by amoshydra Loading…
[SYCL] support OPs: conv_2d, conv_2d_dw, conv2d_transpose documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24600 opened Jun 14, 2026 by arthw Contributor Loading…
server: add system message prefix feature examples python python script changes server
#24599 opened Jun 14, 2026 by coder543 Loading…
ci: fix vulkan docker images ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#24595 opened Jun 13, 2026 by Kononnable Loading…
spec: support eagle3 for qwen3.5 & 3.6 examples model Model specific server
#24593 opened Jun 13, 2026 by ruixiang63 Contributor Loading…
hexagon: support for op-trace (fine-grain tracing of HVX/HMX/DMA events) ggml changes relating to the ggml tensor library for machine learning Hexagon python python script changes script Script related
#24592 opened Jun 13, 2026 by max-krasnyansky Member Draft
HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#24588 opened Jun 13, 2026 by DEV-DUFORD Loading…
vulkan: add iq4_nl support back to FA ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#24585 opened Jun 13, 2026 by jeffbolznv Contributor Loading…
[SYCL] add to support pool_1d, move pool_1d/2d code to pool.cpp/hpp documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24584 opened Jun 13, 2026 by arthw Contributor Loading…
vulkan: support all backend tests for SQR/SQRT/SIN/COS/CLAMP/LEAKY_RELU/NORM ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend WebGPU
#24582 opened Jun 13, 2026 by jeffbolznv Contributor Loading…
vulkan: Support gated_delta_net with S_v=16 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#24581 opened Jun 13, 2026 by jeffbolznv Contributor Loading…
vulkan: support more CONCAT types ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#24579 opened Jun 13, 2026 by jeffbolznv Contributor Loading…
[SYCL]fix reorder function crash:GGML_ASSERT(block_num_y % num_subgroups ==0) examples ggml changes relating to the ggml tensor library for machine learning merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24578 opened Jun 13, 2026 by arthw Contributor Loading…
ggml: optimize concat op by replacing per-element memcpy with row-level memcpy ggml changes relating to the ggml tensor library for machine learning
#24575 opened Jun 13, 2026 by sirohikartik Contributor Loading…
CI: Replace flake8-no-print with flake8-debug and pin repos to hashes
#24572 opened Jun 13, 2026 by jpodivin Contributor Loading…
CUDA: Add conv3d. ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#24569 opened Jun 13, 2026 by Sero1000 Loading…
EXPERIMENT: meta: key external view cache by backend context ggml changes relating to the ggml tensor library for machine learning
#24566 opened Jun 13, 2026 by nycdubliner Draft
[fattn-tune] Add Blackwell MMA config ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#24565 opened Jun 13, 2026 by yaohengxu Contributor Loading…
[SYCL] Enhance set_rows to support q1_0, mxfp4, nvfp4 documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#24564 opened Jun 13, 2026 by arthw Contributor Loading…
CUDA: don't route RDNA3.5 flash attention to the rocWMMA kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#24562 opened Jun 13, 2026 by liminfei-amd Loading…
1 task done
CUDA/HIP: chunked MFMA prefill kernel for GATED_DELTA_NET (CDNA) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#24561 opened Jun 13, 2026 by jadenmach2 Contributor Loading…
ggml-alloc : check realloc result in alloc_tensor_range ggml changes relating to the ggml tensor library for machine learning
#24559 opened Jun 13, 2026 by ricku777-bear Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.