* global bool
* reworked circular to global flag
* cleaner implementation of tiling support in sd cpp
* cleaned rope
* working simplified but still need wraps
* Further clean of rope
* resolve flux conflict
* switch to pad op circular only
* Set ggml to most recent
* Revert ggml temp
* Update ggml to most recent
* Revert unneded flux change
* move circular flag to the GGMLRunnerContext
* Pass through circular param in all places where conv is called
* fix of constant and minor cleanup
* Added back --circular option
* Conv2d circular in vae and various models
* Fix temporal padding for qwen image and other vaes
* Z Image circular tiling
* x and y axis seamless only
* First attempt at chroma seamless x and y
* refactor into pure x and y, almost there
* Fix crash on chroma
* Refactor into cleaner variable choices
* Removed redundant set_circular_enabled
* Sync ggml
* simplify circular parameter
* format code
* no need to perform circular pad on the clip
* simplify circular_axes setting
* unify function naming
* remove unnecessary member variables
* simplify rope
---------
Co-authored-by: Phylliida <phylliidadev@gmail.com>
Co-authored-by: leejet <leejet714@gmail.com>
* Add x0 Flux pred (+prepare for others)
* Fix convert models with empty tensors
* patch_32 exp support attempt
* improve support for patch_32
* follow official pipeline
---------
Co-authored-by: leejet <leejet714@gmail.com>
* feat: default to LCM scheduler for LCM sampling
* fix bug and attempt to get default scheduler for vid_gen when none is set
---------
Co-authored-by: leejet <leejet714@gmail.com>
* Fix syntax for CUDA architecture definitions
* Extend CUDA support to GTX 10 Series to RTX 50 Series
* update cuda installer step version to install cuda 12.8.1
* Remove unsupported compute capability