VDT-2023 / VDTLinks
β10Updated 2 years ago
Alternatives and similar repositories for VDT
Users that are interested in VDT are comparing it to the libraries listed below
Sorting:
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'β21Updated 8 months ago
- πPytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"β27Updated 8 months ago
- Code for CVPR'2022 paper β¨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Lβ¦β37Updated 3 years ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"β32Updated last year
- β48Updated 3 months ago
- β65Updated last year
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decompositionβ37Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ80Updated last year
- β21Updated 2 years ago
- Official implementation of Auroraβ83Updated last year
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformβ¦β29Updated 2 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"β27Updated last year
- β21Updated last year
- Motion-conditional image animation for video editingβ20Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Modelsβ46Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generationβ30Updated 7 months ago
- β25Updated 7 months ago
- ElasticTok: Adaptive Tokenization for Image and Videoβ70Updated 7 months ago
- DREAM: Diffusion Rectification and Estimation-Adaptive Models (CVPR 2024)β40Updated 4 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntaxβ18Updated last year
- β15Updated last month
- β17Updated 10 months ago
- β23Updated last year
- 3D-Aware Video Generationβ76Updated 2 years ago
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".β50Updated last year
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)β78Updated last year
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generationβ30Updated last month
- β64Updated 11 months ago
- [ICLR 2024] Code for FreeNoise based on LaVieβ34Updated last year
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utilityβ53Updated 5 months ago