KAIST-Visual-AI-Group / GrounDiTLinks
[NeurIPS 2024] Official Implementation of GrounDiT
☆57Updated 11 months ago
Alternatives and similar repositories for GrounDiT
Users that are interested in GrounDiT are comparing it to the libraries listed below
Sorting:
- Learning Motion from Low-Rank Adaptation☆45Updated last year
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆107Updated 11 months ago
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆68Updated last month
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆84Updated last year
- ☆50Updated last month
- ☆93Updated 6 months ago
- Distilling Diversity and Control in Diffusion Models☆45Updated 6 months ago
- Experiencing lightning fast (~1s) and accurate drag-based image editing☆82Updated last year
- ☆65Updated last year
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆80Updated last year
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated 9 months ago
- Trying to implement https://arxiv.org/abs/2305.08891☆33Updated 2 years ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆36Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆67Updated 5 months ago
- Official implementation of "Perturbed-Attention Guidance"☆58Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 9 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆70Updated 7 months ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆103Updated last year
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆85Updated 6 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆45Updated last year
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆87Updated last year
- [ICLR 2024] Code for FreeNoise based on LaVie☆33Updated last year
- ☆66Updated last year
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆105Updated last year
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆70Updated 11 months ago
- Official PyTorch Implementation for Readout Guidance, CVPR 2024☆150Updated 4 months ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Updated 11 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated 10 months ago
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆99Updated 11 months ago