| .. |
|
ggml-alloc.h
|
ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
|
2025-03-04 21:24:42 +02:00 |
|
ggml-backend.h
|
Add --no-op-offload to improve -ot pp perf in MoE models like llama4 400B (llama/13386)
|
2025-05-13 13:02:19 +03:00 |
|
ggml-blas.h
|
ggml : build backends as libraries (llama/10256)
|
2024-11-15 22:51:53 +02:00 |
|
ggml-cann.h
|
ggml : build backends as libraries (llama/10256)
|
2024-11-15 22:51:53 +02:00 |
|
ggml-cpp.h
|
ggml : fix ggml_gallocr_ptr type (#1205)
|
2025-04-30 15:20:40 +02:00 |
|
ggml-cpu.h
|
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (llama/13107)
|
2025-05-01 10:39:34 +03:00 |
|
ggml-cuda.h
|
ggml : build backends as libraries (llama/10256)
|
2024-11-15 22:51:53 +02:00 |
|
ggml-kompute.h
|
ggml : build backends as libraries (llama/10256)
|
2024-11-15 22:51:53 +02:00 |
|
ggml-metal.h
|
repo : update links to new url (llama/11886)
|
2025-02-25 13:33:09 +02:00 |
|
ggml-opencl.h
|
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693)
|
2024-12-17 19:23:40 +02:00 |
|
ggml-opt.h
|
mnist: fix segmentation fault (#1227)
|
2025-05-19 09:33:35 +02:00 |
|
ggml-rpc.h
|
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR (llama/12943)
|
2025-05-01 10:39:34 +03:00 |
|
ggml-sycl.h
|
ggml : build backends as libraries (llama/10256)
|
2024-11-15 22:51:53 +02:00 |
|
ggml-vulkan.h
|
vulkan: Make Vulkan optional at runtime (#11493). (llama/11494)
|
2025-02-12 22:00:20 +02:00 |
|
ggml.h
|
llama/ggml: add LLM training support (llama/10544)
|
2025-05-13 13:02:19 +03:00 |
|
gguf.h
|
GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030)
|
2025-01-14 09:36:36 +02:00 |