mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-04-12 18:25:53 +02:00
The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s |
||
|---|---|---|
| .. | ||
| cmake | ||
| include | ||
| src | ||
| CMakeLists.txt | ||
| ggml_vk_generate_shaders.py | ||