whisper.cpp/ggml
Srihari-mcw 8058f19d0b ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (llama/12332)
* Add block interleaving support for Q4_K quantization

* Remove whitespaces and fix CI/CD issues

* Update pointer of bsums from int16_t to const int16_t

* Add vector version of quantize_q8_K_4x8 function

* Update code formatting based on review comments
2025-03-27 11:06:03 +02:00
..
cmake cmake: Comment out GGML_BIN_DIR for now (ggml/1139) 2025-03-27 11:06:03 +02:00
include llama: Add support for RWKV v7 architecture (llama/12412) 2025-03-27 11:06:03 +02:00
src ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (llama/12332) 2025-03-27 11:06:03 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt SYCL: using graphs is configurable by environment variable and compile option (llama/12371) 2025-03-27 11:06:03 +02:00