[NeurIPS 2024] Official Implementation of GrounDiT
☆59Dec 12, 2024Updated last year
Alternatives and similar repositories for GrounDiT
Users that are interested in GrounDiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- ☆24Nov 29, 2023Updated 2 years ago
- [NeurIPS 2023] Official implementation of SyncDiffusion☆168Apr 20, 2024Updated last year
- An open source Multi-View Latent Diffusion Model☆42Feb 23, 2026Updated last month
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆32Oct 17, 2025Updated 5 months ago
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Oct 12, 2025Updated 5 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆32Jun 12, 2025Updated 9 months ago
- Official Implementation of Posterior Distillation Sampling☆93Jul 7, 2025Updated 8 months ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 8 months ago
- [CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"☆28May 27, 2025Updated 9 months ago
- ☆13Oct 14, 2024Updated last year
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆88Sep 3, 2024Updated last year
- InstantDrag: Improving Interactivity in Drag-based Image Editing☆236Oct 14, 2024Updated last year
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆77Jun 7, 2024Updated last year
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆56Nov 7, 2024Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- ☆20Jun 26, 2024Updated last year
- Replicate Cog'ified MMAudio☆18Apr 2, 2025Updated 11 months ago
- Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]☆19Dec 9, 2024Updated last year
- Text and image to video generation: Kandinsky 4.0 (2024)☆150Dec 17, 2024Updated last year
- ☆25Mar 30, 2025Updated 11 months ago
- ☆11Sep 28, 2024Updated last year
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]☆140Apr 16, 2025Updated 11 months ago
- Official implementation of SyncTweedies: A General Generative Framework Based on Synchronized Diffusions (NeurIPS 2024)☆70Aug 4, 2024Updated last year
- Official implementation of the paper "Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention" (Neu…☆136Oct 3, 2024Updated last year
- Official PyTorch implementation of paper MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling☆13Oct 5, 2024Updated last year
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆81Feb 22, 2024Updated 2 years ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆62Jan 22, 2025Updated last year
- This repo contains the python code as well as the webpage html files for the Spice-E project from VAILab at TAU.☆27Dec 9, 2024Updated last year
- [ICCV 2025] DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models (official implement)☆154May 21, 2025Updated 10 months ago
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆61Jun 27, 2025Updated 8 months ago
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆99Nov 27, 2024Updated last year
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated last year
- ☆46Nov 20, 2025Updated 4 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- ☆34Dec 29, 2025Updated 2 months ago