lodestone-rock / flowLinks
☆164Updated last week
Alternatives and similar repositories for flow
Users that are interested in flow are comparing it to the libraries listed below
Sorting:
- Official implementation of "Normalized Attention Guidance"☆175Updated 5 months ago
- ☆112Updated 8 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆194Updated 8 months ago
- Multimodal captioner☆201Updated last week
- ☆91Updated 5 months ago
- Various training scripts used to train bigasp☆111Updated 4 months ago
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆126Updated last year
- ☆230Updated 7 months ago
- The best OSS video generation models☆135Updated last year
- ☆79Updated 9 months ago
- ☆96Updated last month
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …☆81Updated last year
- A detailed diagram laying out the full Flux.1 [dev] architecture as shared by Black Forest Labs at https://github.com/black-forest-labs/f…☆81Updated last year
- IP Adapter Instruct☆211Updated last year
- Generate long weighted prompt embeddings for Stable Diffusion☆145Updated 8 months ago
- 🔬 Visualize attention layers from Stable Diffusion☆92Updated 8 months ago
- ☆73Updated 7 months ago
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆247Updated last month
- Tiny AutoEncoder for Hunyuan Video (and other video models)☆255Updated last week
- An inference and training framework for multiple image input in Flux Kontext dev☆426Updated 3 months ago
- Keyframe Interpolation with CogvideoX☆137Updated last year
- Scale-wise Distillation of Diffusion Models☆113Updated 3 months ago
- musubi-tuner modified to tune image2video/video infilling☆33Updated 11 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- A comprehensive codebase for training and finetuning Image <> Latent models.☆48Updated 9 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆188Updated 11 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆270Updated 6 months ago
- ☆160Updated 10 months ago
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!☆47Updated 6 months ago
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆176Updated 3 months ago