alibaba-damo-academy / DyDiTView external linksLinks
The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation".
☆48Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for DyDiT
Users that are interested in DyDiT are comparing it to the libraries listed below
Sorting:
- ☆92Mar 26, 2025Updated 10 months ago
- Implementation of the Mesh-VQVAE of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆17Oct 30, 2024Updated last year
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆31Oct 15, 2025Updated 4 months ago
- Implementation of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆13Mar 24, 2025Updated 10 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- List of diffusion papers accepted in ECCV 2024.☆15Oct 17, 2024Updated last year
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆25Aug 5, 2025Updated 6 months ago
- ☆40Dec 6, 2025Updated 2 months ago
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 4 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆78May 22, 2025Updated 8 months ago
- ☆43Nov 22, 2023Updated 2 years ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆47Jul 4, 2024Updated last year
- ☆20Sep 19, 2023Updated 2 years ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆23Mar 13, 2025Updated 11 months ago
- ☆18Oct 23, 2024Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- ☆28Mar 4, 2025Updated 11 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- ICML2025☆63Aug 28, 2025Updated 5 months ago
- ☆24Nov 1, 2024Updated last year
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆24Feb 19, 2024Updated last year
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Jul 5, 2025Updated 7 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆164Jan 31, 2025Updated last year
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 4 months ago
- The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …☆20Dec 15, 2024Updated last year
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆40Jul 23, 2025Updated 6 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆170Feb 18, 2025Updated 11 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65May 7, 2025Updated 9 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆113Dec 17, 2025Updated 2 months ago
- Official Implementations for Paper - MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues☆120Dec 3, 2025Updated 2 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆93Nov 30, 2025Updated 2 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆54Feb 2, 2026Updated 2 weeks ago
- CODA: Repurposing Continuous VAEs for Discrete Tokenization☆35Jul 4, 2025Updated 7 months ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Mar 4, 2025Updated 11 months ago
- [T-PAMI2025] Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy☆28Jan 13, 2025Updated last year
- Official repository for “PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss”☆175Feb 3, 2026Updated 2 weeks ago
- Projection-augmentation embedding for CLIP-based latent manipulation methods☆25Feb 2, 2026Updated 2 weeks ago
- [ACM Multimedia 2024] Shape-Guided Clothing Warping for Virtual Try-On☆29May 14, 2025Updated 9 months ago