..
ggml-alloc.h
ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
2025-03-04 21:24:42 +02:00
ggml-backend.h
Add --no-op-offload to improve -ot pp perf in MoE models like llama4 400B (llama/13386)
2025-05-13 13:02:19 +03:00
ggml-blas.h
ggml : build backends as libraries (llama/10256)
2024-11-15 22:51:53 +02:00
ggml-cann.h
ggml : build backends as libraries (llama/10256)
2024-11-15 22:51:53 +02:00
ggml-cpp.h
ggml : fix ggml_gallocr_ptr type ( #1205 )
2025-04-30 15:20:40 +02:00
ggml-cpu.h
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (llama/13107)
2025-05-01 10:39:34 +03:00
ggml-cuda.h
ggml : build backends as libraries (llama/10256)
2024-11-15 22:51:53 +02:00
ggml-kompute.h
ggml : build backends as libraries (llama/10256)
2024-11-15 22:51:53 +02:00
ggml-metal.h
repo : update links to new url (llama/11886)
2025-02-25 13:33:09 +02:00
ggml-opencl.h
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693)
2024-12-17 19:23:40 +02:00
ggml-opt.h
mnist: fix segmentation fault ( #1227 )
2025-05-19 09:33:35 +02:00
ggml-rpc.h
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR (llama/12943)
2025-05-01 10:39:34 +03:00
ggml-sycl.h
ggml : build backends as libraries (llama/10256)
2024-11-15 22:51:53 +02:00
ggml-vulkan.h
vulkan: Make Vulkan optional at runtime ( #11493 ). (llama/11494)
2025-02-12 22:00:20 +02:00
ggml.h
ggml-cpu : "align corners" for bilinear upscale/downscale ( #1285 )
2025-07-01 09:11:00 +02:00
gguf.h
GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030)
2025-01-14 09:36:36 +02:00