ik_llama.cpp/examples/quantize-stats
Iwan Kawrakow f6863cfa1b bitnet: add 2 bpw quantization
The scalar dot product already chieves 37 t/s for TG!
2024-06-22 12:02:51 +03:00
..
CMakeLists.txt build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 2024-06-13 00:41:52 +01:00
quantize-stats.cpp bitnet: add 2 bpw quantization 2024-06-22 12:02:51 +03:00