mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-04-16 20:26:03 +02:00
282 B
282 B
llama.cpp/examples/speculative
Demonstration of speculative decoding and tree-based speculative decoding techniques
More info: