ik_llama.cpp/tests
firecoperana 1cb7e1bf39
spec : add self speculative decoding, ngram and refactor (#1261)
* spec : add self speculative decoding and ngram-mod and refactor

common : use common_ prefix for common library function

llama : use LLAMA_TOKEN_NULL

spec : add self speculative decoding (no draft model required) + refactor

spec : add ngram-mod

spec : various improvements ton ngram-map + docs

spec : fix the check-rate logic of ngram-simple

common : add common_speculative_is_compat()

spec : simplify time measurement using common_time_meas

refactor common_sampler_init

refactor common_token_to_piece

refactor and fix cur_p bug

clean up

* spec : remove check rate

* spec: show warnings instead of abort

---------

Co-authored-by: firecoperana <firecoperana>
Co-authored-by: Sascha Rogmann <59577610+srogmann@users.noreply.github.com>
2026-02-13 19:04:55 +01:00
..
.gitignore tests : gitignore ggml-common.h 2024-03-09 14:17:11 +02:00
CMakeLists.txt llama : add token matching support to llama-grammar (#1220) 2026-02-03 07:57:17 +02:00
get-model.cpp ci : add model tests + script wrapper (#4586) 2024-01-26 14:18:00 +02:00
get-model.h ci : add model tests + script wrapper (#4586) 2024-01-26 14:18:00 +02:00
run-json-schema-to-grammar.mjs json-schema-to-grammar improvements (+ added to server) (#5978) 2024-03-21 11:50:43 +00:00
test-autorelease.cpp spec : add self speculative decoding, ngram and refactor (#1261) 2026-02-13 19:04:55 +01:00
test-backend-ops.cpp Update mtmd to improve accuracy of M-RoPE (#993) 2025-11-29 07:27:15 +01:00
test-c.c Nomic Vulkan backend (#4456) 2024-01-29 15:50:50 -05:00
test-chat-parser.cpp Add --webui arg to launch llama.cpp new webui (#786) 2025-10-27 14:22:02 +02:00
test-chat-template.cpp Tool calls support from mainline (#723) 2025-09-01 08:38:49 +03:00
test-chat.cpp fix kimi-k2 tool call (#996) 2025-11-24 06:51:16 +01:00
test-double-float.cpp Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00
test-function-calls.cpp Fix for Deepseek r1 parsing (#676) 2025-08-08 13:56:44 +03:00
test-grad0.cpp ggml : refactor rope norm/neox (#7634) 2024-06-05 11:29:20 +03:00
test-grammar-integration.cpp llama : add token matching support to llama-grammar (#1220) 2026-02-03 07:57:17 +02:00
test-grammar-llguidance.cpp Tool calls support from mainline (#723) 2025-09-01 08:38:49 +03:00
test-grammar-parser.cpp llama : add token matching support to llama-grammar (#1220) 2026-02-03 07:57:17 +02:00
test-json-partial.cpp Tool calls support from mainline (#723) 2025-09-01 08:38:49 +03:00
test-json-schema-to-grammar.cpp Update grammar (#1023) 2025-11-30 18:45:38 +01:00
test-llama-grammar.cpp llama : add token matching support to llama-grammar (#1220) 2026-02-03 07:57:17 +02:00
test-model-load-cancel.cpp spec : add self speculative decoding, ngram and refactor (#1261) 2026-02-13 19:04:55 +01:00
test-opt.cpp code : normalize enum names (#5697) 2024-02-25 12:09:09 +02:00
test-quantize-fns.cpp Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00
test-quantize-perf.cpp Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00
test-regex-partial.cpp llama : add token matching support to llama-grammar (#1220) 2026-02-03 07:57:17 +02:00
test-rope.cpp Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00
test-sampling.cpp Merge mainline - Aug 12 2024 (#17) 2024-08-12 15:14:32 +02:00
test-tokenizer-0.cpp spec : add self speculative decoding, ngram and refactor (#1261) 2026-02-13 19:04:55 +01:00
test-tokenizer-0.py py : logging and flake8 suppression refactoring (#7081) 2024-05-05 08:07:48 +03:00
test-tokenizer-0.sh tests : fix test-tokenizer-0.sh 2024-05-28 15:04:09 +03:00
test-tokenizer-1-bpe.cpp Server: refactor and rename functions (#1151) 2026-01-18 08:16:57 +02:00
test-tokenizer-1-spm.cpp Server: refactor and rename functions (#1151) 2026-01-18 08:16:57 +02:00
test-tokenizer-random.py Merge mainline llama.cpp (#3) 2024-07-27 07:55:01 +02:00