ShoufaChen / Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
โ126Updated 10 months ago
Alternatives and similar repositories for Awesome-Diffusion-Transformers:
Users that are interested in Awesome-Diffusion-Transformers are comparing it to the libraries listed below
- [ICLR 2025] Autoregressive Video Generation without Vector Quantizationโ324Updated last week
- Scaling Diffusion Transformers with Mixture of Expertsโ245Updated 4 months ago
- ๐ฅ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".โ234Updated last month
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsโ259Updated last month
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Modelโ395Updated 2 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsโ134Updated 7 months ago
- Scaling RWKV-Like Architectures for Diffusion Modelsโ122Updated 9 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Needโ165Updated last month
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Trainingโ165Updated this week
- ๐ This is a repository for organizing papers, codes and other resources related to unified multimodal models.โ342Updated last week
- XQ-GAN๐: An Open-source Image Tokenization Framework for Autoregressive Generationโ182Updated last week
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generationโ212Updated last week
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"โ178Updated 4 months ago
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxiโฆโ226Updated 8 months ago
- ๆฉๆฃๆจกๅ็ฎๆณๅบ็กๆๆกฃใ่ฎญ็ปใๅฎ้ชใ้จ็ฝฒ็ญไปๅบโ36Updated this week
- โ100Updated 7 months ago
- Towards training VQ-VAE models robustly!โ43Updated 2 weeks ago
- This is a repo to track the latest autoregressive visual generation papers.โ119Updated this week
- ๐ฅstable, simple, state-of-the-art VQVAE toolkit & cookbookโ74Updated 7 months ago
- ๐ฅ [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)โ164Updated 9 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generationโ227Updated this week
- Scalable Diffusion Models with State Space Backboneโ150Updated 10 months ago
- A list for Text-to-Video, Image-to-Video worksโ218Updated last month
- โ112Updated 7 months ago
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Modelโ113Updated 2 weeks ago
- โ131Updated last month
- ๐ Collection of awesome generation acceleration resources.โ112Updated this week
- A light-weight and high-efficient training framework for accelerating diffusion tasks.โ45Updated 4 months ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.โ271Updated 6 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flowsโ50Updated 2 weeks ago