InvokeAI/invokeai/backend/quantization
Kevin Turner 8bd52ed744 fix: improve gguf performance with torch.compile
pytorch 2.7 does not implement `set.__contains__`, so make this a list instead.

See https://github.com/pytorch/pytorch/issues/145761
2025-05-22 13:42:09 +10:00
..
gguf fix: improve gguf performance with torch.compile 2025-05-22 13:42:09 +10:00
scripts Rename params for flux and flux vae, add comments explaining use of the config_path in model config 2024-08-26 20:17:50 -04:00
__init__.py Move requantize.py to the quatnization/ dir. 2024-08-26 20:17:50 -04:00
bnb_llm_int8.py Simplify the state management in InvokeLinear8bitLt and add unit tests. This is in preparation for wrapping it to support streaming of weights from cpu to gpu. 2024-12-24 14:32:11 +00:00
bnb_nf4.py Install sub directories with folders correctly, ensure consistent dtype of tensors in flux pipeline and vae 2024-08-26 20:17:50 -04:00