InvokeAI/invokeai/backend/stable_diffusion
Alexander Eichhorn eb3f1c9a61 feat: Add Z-Image-Turbo model support
Add comprehensive support for Z-Image-Turbo (S3-DiT) models including:

Backend:
- New BaseModelType.ZImage in taxonomy
- Z-Image model config classes (ZImageTransformerConfig, Qwen3TextEncoderConfig)
- Model loader for Z-Image transformer and Qwen3 text encoder
- Z-Image conditioning data structures
- Step callback support for Z-Image with FLUX latent RGB factors

Invocations:
- z_image_model_loader: Load Z-Image transformer and Qwen3 encoder
- z_image_text_encoder: Encode prompts using Qwen3 with chat template
- z_image_denoise: Flow matching denoising with time-shifted sigmas
- z_image_image_to_latents: Encode images to 16-channel latents
- z_image_latents_to_image: Decode latents using FLUX VAE

Frontend:
- Z-Image graph builder for text-to-image generation
- Model picker and validation updates for z-image base type
- CFG scale now allows 0 (required for Z-Image-Turbo)
- Clip skip disabled for Z-Image (uses Qwen3, not CLIP)
- Optimal dimension settings for Z-Image (1024x1024)

Technical details:
- Uses Qwen3 text encoder (not CLIP/T5)
- 16 latent channels with FLUX-compatible VAE
- Flow matching scheduler with dynamic time shift
- 8 inference steps recommended for Turbo variant
- bfloat16 inference dtype
2025-12-01 00:22:32 +01:00
..
diffusion feat: Add Z-Image-Turbo model support 2025-12-01 00:22:32 +01:00
extensions Imports 2025-03-26 12:55:10 +11:00
schedulers Ruff format 2024-07-29 22:59:17 +03:00
__init__.py Remove old seamless class 2024-07-23 18:04:33 +03:00
denoise_context.py Suggested changes 2024-07-22 22:17:29 +03:00
diffusers_pipeline.py ruff fixes 2025-05-19 13:50:04 +10:00
diffusion_backend.py Merge branch 'main' into stalker-modular_freeu 2024-07-23 09:53:32 -04:00
extension_callback_type.py Add rescale cfg support to denoise 2024-07-21 17:33:43 +03:00
extensions_manager.py Optimize weights handling 2024-07-30 03:39:01 +03:00
multi_diffusion_pipeline.py fix(app): step callbacks for SD, FLUX, MultiDiffusion 2024-09-22 21:20:32 +03:00
vae_tiling.py Expose the VAE tile_size on the VAE encode and decode invocations. 2024-07-02 09:07:03 -04:00