mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-04-12 02:05:38 +02:00
282 B
282 B
llama.cpp/examples/speculative
Demonstration of speculative decoding and tree-based speculative decoding techniques
More info: