InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2026-04-20 13:45:48 +02:00

Author	SHA1	Message	Date
Lincoln Stein	b42274a57e	Feat[model support]: Qwen Image — full pipeline with edit, generate LoRA, GGUF, quantization, and UI (#9000 )	2026-04-12 14:39:13 +02:00
Jonathan	ee600973ed	Broaden text encoder partial-load recovery (#9034 )	2026-04-09 20:09:40 -04:00
4pointoh	f0d09c34a8	feat: add Anima model support (#8961 ) * feat: add Anima model support * schema * image to image * regional guidance * loras * last fixes * tests * fix attributions * fix attributions * refactor to use diffusers reference * fix an additional lora type * some adjustments to follow flux 2 paper implementation * use t5 from model manager instead of downloading * make lora identification more reliable * fix: resolve lint errors in anima module Remove unused variable, fix import ordering, inline dict() call, and address minor lint issues across anima-related files. * Chore Ruff format again * fix regional guidance error * fix(anima): validate unexpected keys after strict=False checkpoint loading Capture the load_state_dict result and raise RuntimeError on unexpected keys (indicating a corrupted or incompatible checkpoint), while logging a warning for missing keys (expected for inv_freq buffers regenerated at runtime). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): make model loader submodel fields required instead of Optional Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): add Classification.Prototype to LoRA loaders, fix exception types Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): fix replace-all in key conversion, warn on DoRA+LoKR, unify grouping functions - Use key.replace(old, new, 1) in _convert_kohya_unet_key and _convert_kohya_te_key to avoid replacing multiple occurrences - Upgrade DoRA+LoKR dora_scale strip from logger.debug to logger.warning since it represents data loss - Replace _group_kohya_keys and _group_by_layer with a single _group_keys_by_layer function parameterized by extra_suffixes, with _KOHYA_KNOWN_SUFFIXES and _PEFT_EXTRA_SUFFIXES constants - Add test_empty_state_dict_returns_empty_model to verify empty input produces a model with no layers Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): add safety cap for Qwen3 sequence length to prevent OOM Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): add denoising range validation, fix closure capture, add edge case tests Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): add T5 to metadata, fix dead code, decouple scheduler type guard Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(anima): update VAE field description for required field Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: regenerate frontend types after upstream merge Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: ruff format anima_denoise.py Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(anima): add T5 encoder metadata recall handler The T5 encoder was added to generation metadata but had no recall handler, so it wasn't restored when recalling from metadata. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore(frontend): add regression test for buildAnimaGraph Add tests for CFG gating (negative conditioning omitted when cfgScale <= 1) and basic graph structure (model loader, text encoder, denoise nodes). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * only show 0.6b for anima * dont show 0.6b for other models * schema * Anima preview 3 * fix ci --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: kappacommit <samwolfe40@gmail.com> Co-authored-by: Alexander Eichhorn <alex@eichhorn.dev> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2026-04-09 12:04:11 -04:00
Jonathan	dc5007fe95	Fix/model cache Qwen/CogView4 cancel repair (#8959 ) * Repair partially loaded Qwen models after cancel to avoid device mismatches * ruff * Repair CogView4 text encoder after canceled partial loads * Avoid MPS CI crash in repair regression test * Fix MPS device assertion in repair test	2026-03-15 10:04:15 -04:00
Lincoln Stein	dfc66b7142	Feature: Add FLUX.2 LOKR model support (detection and loading) (#8909 ) * Add FLUX.2 LOKR model support (detection and loading) (#88) Fix BFL LOKR models being misidentified as AIToolkit format Fix alpha key warning in LOKR QKV split layers Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: lstein <111189+lstein@users.noreply.github.com> * Fix BFL→diffusers key mapping for non-block layers in FLUX.2 LoRA/LoKR BFL's FLUX.2 model uses different names than diffusers' Flux2Transformer2DModel for top-level modules (embedders, modulations, output layers). The existing conversion only handled block-level renames (double_blocks→transformer_blocks), causing "Failed to find module" warnings for non-block LoRA keys like img_in, txt_in, modulation.lin, time_in, and final_layer. --------- Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: lstein <111189+lstein@users.noreply.github.com> Co-authored-by: Alexander Eichhorn <alex@eichhorn.dev>	2026-02-27 00:45:13 +00:00
Harikrishna KP	ddaa12b0fd	Fix bare except clauses and mutable default arguments (#8871 ) * Fix bare except clauses and mutable default arguments Replace bare `except:` with `except Exception:` in sqlite_database.py and mlsd/utils.py to avoid catching KeyboardInterrupt and SystemExit, which can prevent graceful shutdowns and mask critical errors (PEP 8 E722). Replace mutable default arguments (lists) with None in imwatermark/vendor.py to prevent shared state between calls, which is a known Python gotcha that can cause subtle bugs when default mutable objects are modified in place. * add tests for mutable defaults and bare except fixes * Simplify exception propagation tests * Remove unused db initialization in error propagation tests Removed unused database initialization in tests for KeyboardInterrupt and SystemExit. --------- Co-authored-by: Jonathan <34005131+JPPhoto@users.noreply.github.com> Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2026-02-22 23:25:15 -05:00
Jonathan	bacdfecb13	Add dype area option (#8844 ) * Add DyPE area option * Added tests and fixed frontend build * Made more pythonic	2026-02-06 00:55:29 +05:30
Alexander Eichhorn	736f4ffeb1	fix(ui): improve DyPE field ordering and add 'On' preset option (#8793 ) * fix(ui): improve DyPE field ordering and add 'On' preset option - Add ui_order to DyPE fields (100, 101, 102) to group them at bottom of node - Change DyPEPreset from Enum to Literal type for proper frontend dropdown support - Add ui_choice_labels for human-readable dropdown options - Add new 'On' preset to enable DyPE regardless of resolution - Fix frontend input field sorting to respect ui_order (unordered first, then ordered) - Bump flux_denoise node version to 4.4.0 * Chore Ruff check fix * fix(flux): remove .value from dype_preset logging DyPEPreset is now a Literal type (string) instead of an Enum, so .value is no longer needed. * fix(tests): update DyPE tests for Literal type change Update test imports and assertions to use string constants instead of Enum attributes since DyPEPreset is now a Literal type. * feat(flux): add DyPE scale and exponent controls to Linear UI - Add dype_scale (λs) and dype_exponent (λt) sliders to generation settings - Add Zod schemas and parameter types for DyPE scale/exponent - Pass custom values from Linear UI to flux_denoise node - Fix bug where DyPE was enabled even when preset was "off" - Add enhanced logging showing all DyPE parameters when enabled * fix(flux): apply DyPE scale/exponent and add metadata recall - Fix DyPE scale and exponent parameters not being applied in frequency computation (compute_vision_yarn_freqs, compute_yarn_freqs now call get_timestep_mscale) - Add metadata handlers for dype_scale and dype_exponent to enable recall from generated images - Add i18n translations referencing existing parameter labels * fix(flux): apply DyPE scale/exponent and add metadata recall - Fix DyPE scale and exponent parameters not being applied in frequency computation (compute_vision_yarn_freqs, compute_yarn_freqs now call get_timestep_mscale) - Add metadata handlers for dype_scale and dype_exponent to enable recall from generated images - Add i18n translations referencing existing parameter labels * feat(ui): show DyPE scale/exponent only when preset is "on" - Hide scale/exponent controls in UI when preset is not "on" - Only parse/recall scale/exponent from metadata when preset is "on" - Prevents confusion where custom values override preset behavior * fix(dype): only allow custom scale/exponent with 'on' preset Presets (auto, 4k) now use their predefined values and ignore any custom_scale/custom_exponent parameters. Only the 'on' preset allows manual override of these values. This matches the frontend UI behavior where the scale/exponent fields are only shown when 'On' is selected. * refactor(dype): rename 'on' preset to 'manual' Rename the 'on' DyPE preset to 'manual' to better reflect its purpose: allowing users to manually configure scale and exponent values. Updated in: - Backend presets (DYPE_PRESET_ON -> DYPE_PRESET_MANUAL) - Frontend UI labels and options - Redux slice type definitions - Zod schema validation - Tests * refactor(dype): rename 'on' preset to 'manual' Rename the 'on' DyPE preset to 'manual' to better reflect its purpose: allowing users to manually configure scale and exponent values. Updated in: - Backend presets (DYPE_PRESET_ON -> DYPE_PRESET_MANUAL) - Frontend UI labels and options - Redux slice type definitions - Zod schema validation - Tests * fix(dype): update remaining 'on' references to 'manual' - Update docstrings, comments, and error messages to use 'manual' preset name - Simplify FLUX graph builder to always send dype_scale/dype_exponent - Fix UI condition to show DyPE controls for 'manual' preset --------- Co-authored-by: Jonathan <34005131+JPPhoto@users.noreply.github.com> Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2026-01-30 01:28:28 +00:00
Alexander Eichhorn	cff20b45f3	Feature: Add DyPE (Dynamic Position Extrapolation) support to FLUX models for improved high-resolution image generation (#8763 ) * docs: add DyPE implementation plan for FLUX high-resolution generation Add detailed plan for porting ComfyUI-DyPE (Dynamic Position Extrapolation) to InvokeAI, enabling 4K+ image generation with FLUX models without training. Estimated effort: 5-7 developer days. * docs: update DyPE plan with design decisions - Integrate DyPE directly into FluxDenoise (no separate node) - Add 4K preset and "auto" mode for automatic activation - Confirm FLUX Schnell support (same base resolution as Dev) * docs: add activation threshold for DyPE auto mode FLUX can handle resolutions up to ~1.5x natively without artifacts. Set activation_threshold=1536 so DyPE only kicks in above that. * feat(flux): implement DyPE for high-resolution generation Add Dynamic Position Extrapolation (DyPE) support to FLUX models, enabling artifact-free generation at 4K+ resolutions. New files: - invokeai/backend/flux/dype/base.py: DyPEConfig and scaling calculations - invokeai/backend/flux/dype/rope.py: DyPE-enhanced RoPE functions - invokeai/backend/flux/dype/embed.py: DyPEEmbedND position embedder - invokeai/backend/flux/dype/presets.py: Presets (off, auto, 4k) - invokeai/backend/flux/extensions/dype_extension.py: Pipeline integration Modified files: - invokeai/backend/flux/denoise.py: Add dype_extension parameter - invokeai/app/invocations/flux_denoise.py: Add UI parameters UI parameters: - dype_preset: off \| auto \| 4k - dype_scale: Custom magnitude override (0-8) - dype_exponent: Custom decay speed override (0-1000) Auto mode activates DyPE for resolutions > 1536px. Based on: https://github.com/wildminder/ComfyUI-DyPE * feat(flux): add DyPE preset selector to Linear UI Add Linear UI integration for FLUX DyPE (Dynamic Position Extrapolation): - Add ParamFluxDypePreset component with Off/Auto/4K options - Integrate preset selector in GenerationSettingsAccordion for FLUX models - Add state management (paramsSlice, types) for fluxDypePreset - Add dype_preset to FLUX denoise graph builder and metadata - Add translations for DyPE preset label and popover - Add zFluxDypePresetField schema definition Fix DyPE frequency computation: - Remove incorrect mscale multiplication on frequencies - Use only NTK-aware theta scaling for position extrapolation * feat(flux): add DyPE preset to metadata recall - Add FluxDypePreset handler to ImageMetadataHandlers - Parse dype_preset from metadata and dispatch setFluxDypePreset on recall - Add translation key metadata.dypePreset * chore: remove dype-implementation-plan.md Remove internal planning document from the branch. * chore(flux): bump flux_denoise version to 4.3.0 Version bump for dype_preset field addition. * chore: ruff check fix * chore: ruff format * Fix truncated DyPE label in advanced options UI Shorten the label from "DyPE (High-Res)" to "DyPE" to prevent text truncation in the sidebar. The high-resolution context is preserved in the informational popover tooltip. * Add DyPE preset to recall parameters in image viewer The dype_preset metadata was being saved but not displayed in the Recall Parameters tab. Add FluxDypePreset handler to ImageMetadataActions so users can see and recall this parameter. --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jonathan <34005131+JPPhoto@users.noreply.github.com>	2026-01-26 23:54:44 -05:00
Lincoln Stein	5b69403ba8	Merge branch 'main' into copilot/add-unload-model-option	2025-12-24 15:39:46 -05:00
Alexander Eichhorn	ac245cbf6c	feat(backend): add support for xlabs Flux LoRA format (#8686 ) Add support for loading Flux LoRA models in the xlabs format, which uses keys like `double_blocks.X.processor.{qkv\|proj}_lora{1\|2}.{down\|up}.weight`. The xlabs format maps: - lora1 -> img_attn (image attention stream) - lora2 -> txt_attn (text attention stream) - qkv -> query/key/value projection - proj -> output projection Changes: - Add FluxLoRAFormat.XLabs enum value - Add flux_xlabs_lora_conversion_utils.py with detection and conversion - Update formats.py to detect xlabs format - Update lora.py loader to handle xlabs format - Update model probe to accept recognized Flux LoRA formats - Add unit tests for xlabs format detection and conversion Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2025-12-24 20:18:11 +00:00
copilot-swe-agent[bot]	b7afd9b5b3	Fix test failures caused by MagicMock TypeError Configure mock logger to return a valid log level for getEffectiveLevel() to prevent TypeError when comparing with logging.DEBUG constant. The issue was that ModelCache._log_cache_state() checks self._logger.getEffectiveLevel() > logging.DEBUG, and when the logger is a MagicMock without configuration, getEffectiveLevel() returns another MagicMock, causing a TypeError when compared with an int. Fixes all 4 test failures in test_model_cache_timeout.py Co-authored-by: lstein <111189+lstein@users.noreply.github.com>	2025-12-24 05:42:45 +00:00
Lincoln Stein	9d1de81fe2	(style) correct ruff formatting error	2025-12-24 00:19:25 -05:00
copilot-swe-agent[bot]	8d76b4e4d4	Fix ruff whitespace errors and improve timeout logging - Remove all trailing whitespace (W293 errors) - Add debug logging when timeout fires but activity detected - Add debug logging when timeout fires but cache is empty - Only log "Clearing model cache" message when actually clearing - Prevents misleading timeout messages during active generation Co-authored-by: lstein <111189+lstein@users.noreply.github.com>	2025-12-24 04:05:57 +00:00
copilot-swe-agent[bot]	c3217d8a08	Address code review feedback - Remove unused variable in test - Add clarifying comment for daemon thread setting - Add detailed comment explaining cache clearing with 1000 GB value - Improve code documentation Co-authored-by: lstein <111189+lstein@users.noreply.github.com>	2025-12-24 00:27:39 +00:00
copilot-swe-agent[bot]	75a14e2a4b	Add unit tests for model cache timeout functionality - Created test_model_cache_timeout.py with comprehensive tests - Tests timeout clearing behavior - Tests activity resetting timeout - Tests no-timeout default behavior - Tests shutdown canceling timers Co-authored-by: lstein <111189+lstein@users.noreply.github.com>	2025-12-24 00:24:31 +00:00
psychedelicious	454d05bbde	refactor: model manager v3 (#8607 ) * feat(mm): add UnknownModelConfig * refactor(ui): move model categorisation-ish logic to central location, simplify model manager models list * refactor(ui)refactor(ui): more cleanup of model categories * refactor(ui): remove unused excludeSubmodels I can't remember what this was for and don't see any reference to it. Maybe it's just remnants from a previous implementation? * feat(nodes): add unknown as model base * chore(ui): typegen * feat(ui): add unknown model base support in ui * feat(ui): allow changing model type in MM, fix up base and variant selects * feat(mm): omit model description instead of making it "base type filename model" * feat(app): add setting to allow unknown models * feat(ui): allow changing model format in MM * feat(app): add the installed model config to install complete events * chore(ui): typegen * feat(ui): toast warning when installed model is unidentified * docs: update config docstrings * chore(ui): typegen * tests(mm): fix test for MM, leave the UnknownModelConfig class in the list of configs * tidy(ui): prefer types from zod schemas for model attrs * chore(ui): lint * fix(ui): wrong translation string * feat(mm): normalized model storage Store models in a flat directory structure. Each model is in a dir named its unique key (a UUID). Inside that dir is either the model file or the model dir. * feat(mm): add migration to flat model storage * fix(mm): normalized multi-file/diffusers model installation no worky now worky * refactor: port MM probes to new api - Add concept of match certainty to new probe - Port CLIP Embed models to new API - Fiddle with stuff * feat(mm): port TIs to new API * tidy(mm): remove unused probes * feat(mm): port spandrel to new API * fix(mm): parsing for spandrel * fix(mm): loader for clip embed * fix(mm): tis use existing weight_files method * feat(mm): port vae to new API * fix(mm): vae class inheritance and config_path * tidy(mm): patcher types and import paths * feat(mm): better errors when invalid model config found in db * feat(mm): port t5 to new API * feat(mm): make config_path optional * refactor(mm): simplify model classification process Previously, we had a multi-phase strategy to identify models from their files on disk: 1. Run each model config classes' `matches()` method on the files. It checks if the model could possibly be an identified as the candidate model type. This was intended to be a quick check. Break on the first match. 2. If we have a match, run the config class's `parse()` method. It derive some additional model config attrs from the model files. This was intended to encapsulate heavier operations that may require loading the model into memory. 3. Derive the common model config attrs, like name, description, calculate the hash, etc. Some of these are also heavier operations. This strategy has some issues: - It is not clear how the pieces fit together. There is some back-and-forth between different methods and the config base class. It is hard to trace the flow of logic until you fully wrap your head around the system and therefore difficult to add a model architecture to the probe. - The assumption that we could do quick, lightweight checks before heavier checks is incorrect. We often _must_ load the model state dict in the `matches()` method. So there is no practical perf benefit to splitting up the responsibility of `matches()` and `parse()`. - Sometimes we need to do the same checks in `matches()` and `parse()`. In these cases, splitting the logic is has a negative perf impact because we are doing the same work twice. - As we introduce the concept of an "unknown" model config (i.e. a model that we cannot identify, but still record in the db; see #8582), we will _always_ run _all_ the checks for every model. Therefore we need not try to defer heavier checks or resource-intensive ops like hashing. We are going to do them anyways. - There are situations where a model may match multiple configs. One known case are SD pipeline models with merged LoRAs. In the old probe API, we relied on the implicit order of checks to know that if a model matched for pipeline _and_ LoRA, we prefer the pipeline match. But, in the new API, we do not have this implicit ordering of checks. To resolve this in a resilient way, we need to get all matches up front, then use tie-breaker logic to figure out which should win (or add "differential diagnosis" logic to the matchers). - Field overrides weren't handled well by this strategy. They were only applied at the very end, if a model matched successfully. This means we cannot tell the system "Hey, this model is type X with base Y. Trust me bro.". We cannot override the match logic. As we move towards letting users correct mis-identified models (see #8582), this is a requirement. We can simplify the process significantly and better support "unknown" models. Firstly, model config classes now have a single `from_model_on_disk()` method that attempts to construct an instance of the class from the model files. This replaces the `matches()` and `parse()` methods. If we fail to create the config instance, a special exception is raised that indicates why we think the files cannot be identified as the given model config class. Next, the flow for model identification is a bit simpler: - Derive all the common fields up-front (name, desc, hash, etc). - Merge in overrides. - Call `from_model_on_disk()` for every config class, passing in the fields. Overrides are handled in this method. - Record the results for each config class and choose the best one. The identification logic is a bit more verbose, with the special exceptions and handling of overrides, but it is very clear what is happening. The one downside I can think of for this strategy is we do need to check every model type, instead of stopping at the first match. It's a bit less efficient. In practice, however, this isn't a hot code path, and the improved clarity is worth far more than perf optimizations that the end user will likely never notice. * refactor(mm): remove unused methods in config.py * refactor(mm): add model config parsing utils * fix(mm): abstractmethod bork * tidy(mm): clarify that model id utils are private * fix(mm): fall back to UnknownModelConfig correctly * feat(mm): port CLIPVisionDiffusersConfig to new api * feat(mm): port SigLIPDiffusersConfig to new api * feat(mm): make match helpers more succint * feat(mm): port flux redux to new api * feat(mm): port ip adapter to new api * tidy(mm): skip optimistic override handling for now * refactor(mm): continue iterating on config * feat(mm): port flux "control lora" and t2i adapter to new api * tidy(ui): use Extract to get model config types * fix(mm): t2i base determination * feat(mm): port cnet to new api * refactor(mm): add config validation utils, make it all consistent and clean * feat(mm): wip port of main models to new api * feat(mm): wip port of main models to new api * feat(mm): wip port of main models to new api * docs(mm): add todos * tidy(mm): removed unused model merge class * feat(mm): wip port main models to new api * tidy(mm): clean up model heuristic utils * tidy(mm): clean up ModelOnDisk caching * tidy(mm): flux lora format util * refactor(mm): make config classes narrow Simpler logic to identify, less complexity to add new model, fewer useless attrs that do not relate to the model arch, etc * refactor(mm): diffusers loras w * feat(mm): consistent naming for all model config classes * fix(mm): tag generation & scattered probe fixes * tidy(mm): consistent class names * refactor(mm): split configs into separate files * docs(mm): add comments for identification utils * chore(ui): typegen * refactor(mm): remove legacy probe, new configs dir structure, update imports * fix(mm): inverted condition * docs(mm): update docsstrings in factory.py * docs(mm): document flux variant attr * feat(mm): add helper method for legacy configs * feat(mm): satisfy type checker in flux denoise * docs(mm): remove extraneous comment * fix(mm): ensure unknown model configs get unknown attrs * fix(mm): t5 identification * fix(mm): sdxl ip adapter identification * feat(mm): more flexible config matching utils * fix(mm): clip vision identification * feat(mm): add sanity checks before probing paths * docs(mm): add reminder for self for field migrations * feat(mm): clearer naming for main config class hierarchy * feat(mm): fix clip vision starter model bases, add ref to actual models * feat(mm): add model config schema migration logic * fix(mm): duplicate import * refactor(mm): split big migration into 3 Split the big migration that did all of these things into 3: - Migration 22: Remove unique contraint on base/name/type in models table - Migration 23: Migrate configs to v6.8.0 schemas - Migration 24: Normalize file storage * fix(mm): pop base/type/format when creating unknown model config * fix(db): migration 22 insert only real cols * fix(db): migration 23 fall back to unknown model when config change fails * feat(db): run migrations 23 and 24 * fix(mm): false negative on flux lora * fix(mm): vae checkpoint probe checking for dir instead of file * fix(mm): ModelOnDisk skips dirs when looking for weights Previously a path w/ any of the known weights suffixes would be seen as a weights file, even if it was a directory. We now check to ensure the candidate path is actually a file before adding it to the list of weights. * feat(mm): add method to get main model defaults from a base * feat(mm): do not log when multiple non-unknown model matches * refactor(mm): continued iteration on model identifcation * tests(mm): refactor model identification tests Overhaul of model identification (probing) tests. Previously we didn't test the correctness of probing except in a few narrow cases - now we do. See tests/model_identification/README.md for a detailed overview of the new test setup. It includes instructions for adding a new test case. In brief: - Download the model you want to add as a test case - Run a script against it to generate the test model files - Fill in the expected model type/format/base/etc in the generated test metadata JSON file Included test cases: - All starter models - A handful of other models that I had installed - Models present in the previous test cases as smoke tests, now also tested for correctness * fix(mm): omit type/format/base when creating unknown config instance * feat(mm): use ValueError for model id sanity checks * feat(mm): add flag for updating models to allow class changes * tests(mm): fix remaining MM tests * feat: allow users to edit models freely * feat(ui): add warning for model settings edit * tests(mm): flux state dict tests * tidy: remove unused file * fix(mm): lora state dict loading in model id * feat(ui): use translation string for model edit warning * docs(db): update version numbers in migration comments * chore: bump version to v6.9.0a1 * docs: update model id readme * tests(mm): attempt to fix windows model id tests * fix(mm): issue with deleting single file models * feat(mm): just delete the dir w/ rmtree when deleting model * tests(mm): windows CI issue * fix(ui): typegen schema sync * fix(mm): fixes for migration 23 - Handle CLIP Embed and Main SD models missing variant field - Handle errors when calling the discriminator function, previously only handled ValidationError but it could be a ValueError or something else - Better logging for config migration * chore: bump version to v6.9.0a2 * chore: bump version to v6.9.0a3	2025-10-15 10:18:53 +11:00
Kent Keirsey	af58a75e97	Support PEFT Loras with Base_Model.model prefix (#8433 ) * Support PEFT Loras with Base_Model.model prefix * update tests * ruff * fix python complaints * update kes * format keys * remove unneeded test	2025-08-18 09:14:46 -04:00
psychedelicious	a8a07598c8	chore: ruff	2025-08-18 21:14:00 +10:00
psychedelicious	23206e22e8	tests: skip excessively flaky MPS-specific tests in CI	2025-08-18 21:14:00 +10:00
Heathen711	8cef0f5bf5	Update supported cuda slot input.	2025-06-16 19:33:19 +10:00
Kevin Turner	50cf285efb	fix: group aitoolkit lora layers	2025-06-16 19:08:11 +10:00
Kevin Turner	a214f4fff5	fix: group aitoolkit lora layers	2025-06-16 19:08:11 +10:00
Kevin Turner	2981591c36	test: add some aitoolkit lora tests	2025-06-16 19:08:11 +10:00
Kevin Turner	52a8ad1c18	chore: rename model.size to model.file_size to disambiguate from RAM size or pixel size	2025-04-10 09:53:03 +10:00
Kevin Turner	98260a8efc	test: add size field to test model configs	2025-04-10 09:53:03 +10:00
psychedelicious	aaa6211625	chore(backend): ruff C420	2025-03-28 18:28:32 -04:00
Billy	182580ff69	Imports	2025-03-26 12:55:10 +11:00
Billy	8e9d5c1187	Ruff formatting	2025-03-26 12:30:31 +11:00
Billy	99aac5870e	Remove star imports	2025-03-26 12:27:00 +11:00
Ryan Dick	f1fde792ee	Get FLUX Redux working: model loading and inference.	2025-03-06 10:31:17 +11:00
Ryan Dick	5357d6e08e	Rename ConcatenatedLoRALayer to MergedLayerPatch. And other minor cleanup.	2025-01-28 14:51:35 +00:00
Ryan Dick	28514ba59a	Update ConcatenatedLoRALayer to work with all sub-layer types.	2025-01-28 14:51:35 +00:00
Ryan Dick	206f261e45	Add utils for loading FLUX OneTrainer DoRA models.	2025-01-28 14:51:35 +00:00
Ryan Dick	dfa253e75b	Add utils for working with Kohya LoRA keys.	2025-01-28 14:51:35 +00:00
Ryan Dick	faa4fa02c0	Expand unit tests to test for confusion between FLUX LoRA formats.	2025-01-28 14:51:35 +00:00
Ryan Dick	5bd6428fdd	Add is_state_dict_likely_in_flux_onetrainer_format() util function.	2025-01-28 14:51:35 +00:00
Ryan Dick	8b4f411f7b	Add a test state dict for the OneTrainer DoRA format.	2025-01-28 14:51:35 +00:00
Ryan Dick	e2f05d0800	Add unit tests for LoKR patch layers. The new tests trigger a bug when LoKR layers are applied to BnB-quantized layers (also impacts several other LoRA variant types).	2025-01-22 09:20:40 +11:00
Ryan Dick	36a3869af0	Add keep_ram_copy_of_weights config option.	2025-01-16 15:35:25 +00:00
Ryan Dick	c76d08d1fd	Add keep_ram_copy option to CachedModelOnlyFullLoad.	2025-01-16 15:08:23 +00:00
Ryan Dick	04087c38ce	Add keep_ram_copy option to CachedModelWithPartialLoad.	2025-01-16 14:51:44 +00:00
Ryan Dick	974b4671b1	Deprecate the `ram` and `vram` configs to make the migration to dynamic memory limits smoother for users who had previously overriden these values.	2025-01-07 16:45:29 +00:00
Ryan Dick	d7ab464176	Offload the current model when locking if it is already partially loaded and we have insufficient VRAM.	2025-01-07 02:53:44 +00:00
Ryan Dick	5eafe1ec7a	Fix ModelCache execution device selection in unit tests.	2025-01-07 01:20:15 +00:00
Ryan Dick	a167632f09	Calculate model cache size limits dynamically based on the available RAM / VRAM.	2025-01-07 01:14:20 +00:00
Ryan Dick	402dd840a1	Add seed to flaky unit test.	2025-01-07 00:31:00 +00:00
Ryan Dick	d0bfa019be	Add 'enable_partial_loading' config flag.	2025-01-07 00:31:00 +00:00
Ryan Dick	535e45cedf	First pass at adding partial loading support to the ModelCache.	2025-01-07 00:30:58 +00:00
Ryan Dick	9a0a226ce1	Fix bitsandbytes imports in unit tests on MacOS.	2024-12-30 10:41:48 -05:00

1 2 3 4 5

244 Commits