ik_llama.cpp

History

mcm007 b2cb4512c5 Create parameters overview (#1269 ) * raw parameters.md * fix small typos in common.cpp * Update build args in parameters.md * Update parameters.md - format as table - sections * Update README.md - quickstart - build and run * Update parameters.md other tools examples * add PR links * multiple updates to parameters.md - description - add jargon section - add suggestions from feedbacks * don't imply that only linux is supported in README.md * add alias to parameters.md * Update README.md with recent models and features * Update parameters.md with latest features * address suggestions - no-ooae - placeholder for common commands - no-kv-offload - llama-sweep-bench - placeholder for unique parameters * specify Linux distro in README.md		2026-02-20 07:20:56 +01:00
..
cmake	Merge mainline llama.cpp (#3 )	2024-07-27 07:55:01 +02:00
base64.hpp	llava : expose as a shared library for downstream projects (#3613 )	2023-11-07 00:36:23 +03:00
build-info.cpp.in	build : link against build info instead of compiling against it (#3879 )	2023-11-02 08:50:16 +02:00
chat-parser-xml-toolcall.cpp	Add chat parser for MiroThinker (#1138 )	2026-01-13 08:07:12 +02:00
chat-parser-xml-toolcall.h	Add chat parser for MiroThinker (#1138 )	2026-01-13 08:07:12 +02:00
chat-parser.cpp	Add chat parser for MiroThinker (#1138 )	2026-01-13 08:07:12 +02:00
chat-parser.h	Refactor chat and server file (#1062 )	2025-12-15 08:27:20 +01:00
chat.cpp	llama : add token matching support to llama-grammar (#1220 )	2026-02-03 07:57:17 +02:00
chat.h	Add chat parser for MiroThinker (#1138 )	2026-01-13 08:07:12 +02:00
CMakeLists.txt	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
common.cpp	Create parameters overview (#1269 )	2026-02-20 07:20:56 +01:00
common.h	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
console.cpp	check C++ code with -Wmissing-declarations (#3184 )	2023-09-15 15:38:27 -04:00
console.h	gguf : new file format with flexible meta data (beta) (#2398 )	2023-08-21 23:07:43 +03:00
json-partial.cpp	common: Generalized XML-style tool-call parsing with streaming support (#958 )	2025-11-18 15:29:58 +01:00
json-partial.h	Move minja and nlohmann/json to vendor (#802 )	2025-09-27 09:12:35 +02:00
json-schema-to-grammar.cpp	Update grammar (#1023 )	2025-11-30 18:45:38 +01:00
json-schema-to-grammar.h	common: Generalized XML-style tool-call parsing with streaming support (#958 )	2025-11-18 15:29:58 +01:00
llguidance.cpp	Tool calls support from mainline (#723 )	2025-09-01 08:38:49 +03:00
log.cpp	Refactor chat and server file (#1062 )	2025-12-15 08:27:20 +01:00
log.h	Server: refactor and rename functions (#1151 )	2026-01-18 08:16:57 +02:00
ngram-cache.cpp	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
ngram-cache.h	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
ngram-map.cpp	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
ngram-map.h	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
ngram-mod.cpp	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
ngram-mod.h	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
regex-partial.cpp	llama : add token matching support to llama-grammar (#1220 )	2026-02-03 07:57:17 +02:00
regex-partial.h	Tool calls support from mainline (#723 )	2025-09-01 08:38:49 +03:00
sampling.cpp	Fix adaptive p sampler bug with string ban (#1287 )	2026-02-20 07:11:36 +01:00
sampling.h	Fix adaptive p sampler bug with string ban (#1287 )	2026-02-20 07:11:36 +01:00
speculative.cpp	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
speculative.h	spec : add self speculative decoding, ngram and refactor (#1261 )	2026-02-13 19:04:55 +01:00
train.cpp	Server: refactor and rename functions (#1151 )	2026-01-18 08:16:57 +02:00
train.h	sync : ggml (backend v2) (#3912 )	2023-11-13 14:16:23 +02:00