KAIST-Visual-AI-Group / GrounDiTLinks
Official Implementation of GrounDiT (NeurIPS 2024)
☆54Updated 7 months ago
Alternatives and similar repositories for GrounDiT
Users that are interested in GrounDiT are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆103Updated 8 months ago
- Learning Motion from Low-Rank Adaptation☆45Updated last year
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆61Updated last month
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆83Updated last year
- Distilling Diversity and Control in Diffusion Models☆44Updated 3 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆40Updated 11 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆64Updated 2 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆27Updated 3 weeks ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated 11 months ago
- Official implementation of "Perturbed-Attention Guidance"☆57Updated last year
- [ECCV2024] PartCraft: Crafting Creative Objects by Parts☆93Updated 6 months ago
- Trying to implement https://arxiv.org/abs/2305.08891☆34Updated 2 years ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆79Updated 2 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 6 months ago
- ☆45Updated last week
- Experiencing lightning fast (~1s) and accurate drag-based image editing☆77Updated 9 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆87Updated 11 months ago
- ☆67Updated last year
- ☆23Updated 9 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆80Updated 4 months ago
- ☆64Updated last year
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆103Updated last year
- ☆30Updated 4 months ago
- ☆96Updated 3 months ago
- [ICLR 2024] Code for FreeNoise based on LaVie☆34Updated last year
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Updated 7 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆60Updated 4 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆110Updated 8 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Updated 9 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆70Updated 7 months ago