mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-04-01 21:05:43 +02:00
Since the prefill length is not fixed, graphs constructed for the prefill stage cannot be reused. For this reason, ACL graph execution is disabled by default during prefill. |
||
|---|---|---|
| .. | ||
| BLIS.md | ||
| CANN.md | ||
| CUDA-FEDORA.md | ||
| OPENCL.md | ||
| SYCL.md | ||