The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation".
☆47Apr 10, 2025Updated 11 months ago
Alternatives and similar repositories for DyDiT
Users that are interested in DyDiT are comparing it to the libraries listed below
Sorting:
- ☆92Mar 26, 2025Updated 11 months ago
- Implementation of the Mesh-VQVAE of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆17Oct 30, 2024Updated last year
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆33Oct 15, 2025Updated 4 months ago
- Implementation of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆13Mar 24, 2025Updated 11 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆25Aug 5, 2025Updated 7 months ago
- List of diffusion papers accepted in ECCV 2024.☆15Oct 17, 2024Updated last year
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 5 months ago
- ☆45Dec 6, 2025Updated 3 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- ☆43Nov 22, 2023Updated 2 years ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆82May 22, 2025Updated 9 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆47Jul 4, 2024Updated last year
- ☆20Sep 19, 2023Updated 2 years ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- ☆28Mar 4, 2025Updated last year
- ☆18Oct 23, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated 2 months ago
- ICML2025☆63Aug 28, 2025Updated 6 months ago
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆24Feb 19, 2024Updated 2 years ago
- ☆24Nov 1, 2024Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …☆20Dec 15, 2024Updated last year
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 7 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 4 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆47Jul 5, 2025Updated 8 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 10 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Oct 2, 2024Updated last year
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆119Dec 17, 2025Updated 2 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆95Nov 30, 2025Updated 3 months ago
- Official Implementations for Paper - MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues☆128Dec 3, 2025Updated 3 months ago
- CODA: Repurposing Continuous VAEs for Discrete Tokenization☆35Jul 4, 2025Updated 8 months ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Mar 4, 2025Updated last year
- [T-PAMI2025] Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy☆28Jan 13, 2025Updated last year
- [ACCV 2024] Official PyTorch implementation of "Diffusion Model Compression for Image-to-Image Translation"☆22Aug 31, 2025Updated 6 months ago
- Projection-augmentation embedding for CLIP-based latent manipulation methods☆25Feb 2, 2026Updated last month
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆56Feb 2, 2026Updated last month