ik_llama.cpp

History

Kawrakow 7a8abe29f7 Minor (~2%) iq2_ks TG performance improvement on CUDA (#468 ) Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>		2025-06-01 15:24:33 +03:00
..
cmake	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
include	Trellis quants with CPU inference (#441 )	2025-05-23 09:17:52 +03:00
src	Minor (~2%) iq2_ks TG performance improvement on CUDA (#468 )	2025-06-01 15:24:33 +03:00
.gitignore	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
CMakeLists.txt	Option to enable disable the IQK CPU FA kernels (#429 )	2025-05-17 11:21:58 +03:00