Scalable Diffusion Models with State Space Backbone
☆157Mar 7, 2024Updated last year
Alternatives and similar repositories for DiS
Users that are interested in DiS are comparing it to the libraries listed below
Sorting:
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)☆343Mar 17, 2025Updated 11 months ago
- Scaling RWKV-Like Architectures for Diffusion Models☆143Apr 12, 2024Updated last year
- Transformer-Mamba Diffusion Models☆120Jun 30, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- ☆48Mar 12, 2025Updated 11 months ago
- Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Represe…☆27Jun 24, 2024Updated last year
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆107Nov 20, 2024Updated last year
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆431Nov 10, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 4 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆82Jul 6, 2025Updated 7 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated 11 months ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆421May 15, 2024Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Jun 26, 2024Updated last year
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,096Mar 25, 2023Updated 2 years ago
- [CVPR 2023] GLeaD: Improving GANs with A Generator-Leading Task☆32Jun 5, 2023Updated 2 years ago
- Official implementation of MTM☆21Aug 30, 2023Updated 2 years ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆98Mar 18, 2024Updated last year
- ☆48Mar 31, 2024Updated last year
- More dimensions = More fun☆26Jul 27, 2024Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Nov 29, 2024Updated last year
- My Implementation of Adversarial Diffusion Distillation https://arxiv.org/pdf/2311.17042.pdf☆94Dec 2, 2024Updated last year
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,096Dec 22, 2025Updated 2 months ago
- Consistency Models Made Easy☆325Oct 13, 2024Updated last year
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆507Oct 31, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆230Jun 28, 2024Updated last year
- ☆643May 24, 2024Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- ☆57Mar 29, 2024Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated 11 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆108Jan 2, 2026Updated 2 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆62Jan 22, 2025Updated last year
- TC4D: Trajectory-Conditioned Text-to-4D Generation☆204Oct 15, 2024Updated last year
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,805Feb 13, 2025Updated last year
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆46Nov 2, 2023Updated 2 years ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆143Jan 13, 2025Updated last year