mirror of
https://github.com/ggerganov/llama.cpp
synced 2026-03-08 08:09:22 +01:00
* imatrix: load * imatrix: WIP * imatrix: Add Q2_K quantization * imatrix: also guard against Q2_K_S quantization without importance matrix * imatrix: guard even more against low-bit quantization misuse --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com> |
||
|---|---|---|
| .. | ||
| benchmark-matmult.cpp | ||
| CMakeLists.txt | ||