mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-04-05 06:45:54 +02:00
282 B
282 B
llama.cpp/examples/speculative
Demonstration of speculative decoding and tree-based speculative decoding techniques
More info: