ik_llama.cpp

History

Iwan Kawrakow 2e86b76476 Remove the 16		2025-08-23 14:58:14 +03:00
..
cmake	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
include	AVX512+AVXVNNI GEMM implementation for quants using Q8_K for activations (#710 )	2025-08-22 06:27:07 +03:00
src	Remove the 16	2025-08-23 14:58:14 +03:00
.gitignore	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
CMakeLists.txt	Enable CUDA graphs for MoE models + GPT-OSS support (#689 )	2025-08-15 09:18:07 +03:00