ik_llama.cpp

History

Nexes the Elder c7ecd4e23a Legacy quants conversion schemes in convert_hf_to_gguf.py (#449 ) * Legacy quants conversion schemes in convert_hf_to_gguf.py This, notably in order to make smaller conversions to generate an iMatrix file. `Q4_0`,`Q4_1` are here using embeddings, output, attn_k and attn_v in q5_0. `Q5_0`,`Q5_1` are here using embeddings, output, attn_k and attn_v in q8_0. Adapted from the following llama.cpp mainline PR : https://github.com/ggml-org/llama.cpp/pull/9022 Original author @chentyjpm Also, 2 forgotten mentions of FTYPE IQ3_KL in llama.cpp file. * forgotten IQ5_KS case mention		2025-05-24 11:49:10 +03:00
..
CMakeLists.txt	Be able to repack tensors at run time (#147 )	2024-12-17 14:16:34 +01:00
llama-grammar.cpp	Merge mainline - Aug 12 2024 (#17 )	2024-08-12 15:14:32 +02:00
llama-grammar.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
llama-impl.h	Add copyright notices (#317 )	2025-04-07 10:43:26 +02:00
llama-sampling.cpp	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
llama-sampling.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
llama-vocab.cpp	LlaMA-4 support (text only) (#321 )	2025-04-10 09:05:21 +02:00
llama-vocab.h	Merge mainline - Aug 12 2024 (#17 )	2024-08-12 15:14:32 +02:00
llama.cpp	Legacy quants conversion schemes in convert_hf_to_gguf.py (#449 )	2025-05-24 11:49:10 +03:00
unicode-data.cpp	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
unicode-data.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
unicode.cpp	Deepseek V3 support added (#176 )	2025-01-23 18:24:10 +02:00
unicode.h	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00