ik_llama.cpp/ggml
Kawrakow 7a8abe29f7
Minor (~2%) iq2_ks TG performance improvement on CUDA (#468)
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2025-06-01 15:24:33 +03:00
..
cmake Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00
include Trellis quants with CPU inference (#441) 2025-05-23 09:17:52 +03:00
src Minor (~2%) iq2_ks TG performance improvement on CUDA (#468) 2025-06-01 15:24:33 +03:00
.gitignore Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00
CMakeLists.txt Option to enable disable the IQK CPU FA kernels (#429) 2025-05-17 11:21:58 +03:00