mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-03-03 21:59:44 +01:00
282 B
282 B
llama.cpp/examples/speculative
Demonstration of speculative decoding and tree-based speculative decoding techniques
More info: