ik_llama.cpp

History

Kawrakow f27cd40542 Enable faster prompt processing with mainline llama.cpp GGUFs (#409 ) * Enable MLA-3 in crippled GGUFs: WIP * Enable MLA-3 in crippled GGUFs: seems to work * Add newly created tensors to model.tensors_by_name Else they don't get run-time repacked. --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>		2025-05-12 07:49:51 +03:00
..
CMakeLists.txt	Be able to repack tensors at run time (#147 )	2024-12-17 14:16:34 +01:00
llama-grammar.cpp	Merge mainline - Aug 12 2024 (#17 )	2024-08-12 15:14:32 +02:00
llama-grammar.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
llama-impl.h	Add copyright notices (#317 )	2025-04-07 10:43:26 +02:00
llama-sampling.cpp	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
llama-sampling.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
llama-vocab.cpp	LlaMA-4 support (text only) (#321 )	2025-04-10 09:05:21 +02:00
llama-vocab.h	Merge mainline - Aug 12 2024 (#17 )	2024-08-12 15:14:32 +02:00
llama.cpp	Enable faster prompt processing with mainline llama.cpp GGUFs (#409 )	2025-05-12 07:49:51 +03:00
unicode-data.cpp	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
unicode-data.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
unicode.cpp	Deepseek V3 support added (#176 )	2025-01-23 18:24:10 +02:00
unicode.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00