KAIST-Visual-AI-Group/GrounDiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KAIST-Visual-AI-Group/GrounDiT)

KAIST-Visual-AI-Group / GrounDiT

[NeurIPS 2024] Official Implementation of GrounDiT

☆59

Alternatives and similar repositories for GrounDiT

Users that are interested in GrounDiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KAIST-Visual-AI-Group / Psi-Sampler
View on GitHub
[NeurIPS 2025, Spotlight] Official code for Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score-Based Genera…
☆18Feb 3, 2026Updated 5 months ago
StevenShaw1999 / RnB
View on GitHub
☆24Nov 29, 2023Updated 2 years ago
KAIST-Visual-AI-Group / VG-AVS
View on GitHub
Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
☆24Feb 5, 2026Updated 5 months ago
KAIST-Visual-AI-Group / SyncDiffusion
View on GitHub
[NeurIPS 2023] Official implementation of SyncDiffusion
☆169Apr 20, 2024Updated 2 years ago
KAIST-Visual-AI-Group / Flow-Inference-Time-Scaling
View on GitHub
[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆75Oct 12, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
KAIST-Visual-AI-Group / APC-VLM
View on GitHub
[ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
☆66Sep 12, 2025Updated 10 months ago
KAIST-Visual-AI-Group / ORIGEN
View on GitHub
[NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
☆32Oct 17, 2025Updated 9 months ago
KAIST-Visual-AI-Group / BezierFlow
View on GitHub
[ICLR 2026] Official code for BézierFlow: Learning Bézier Stochastic Interpolant Schedulers for Few-Step Generation
☆21Apr 13, 2026Updated 3 months ago
KAIST-Visual-AI-Group / PDS
View on GitHub
Official Implementation of Posterior Distillation Sampling
☆94Jul 7, 2025Updated last year
DoHunLee1 / VideoGuide
View on GitHub
[CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"
☆30May 27, 2025Updated last year
camenduru / TANGO-jupyter
View on GitHub
☆13Oct 14, 2024Updated last year
mohammadasim98 / mv-ldm
View on GitHub
An open source Multi-View Latent Diffusion Model
☆44Feb 23, 2026Updated 5 months ago
xiefan-guo / initno
View on GitHub
[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
☆80Jun 7, 2024Updated 2 years ago
yangqy1110 / NC-SDEdit
View on GitHub
[ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
☆89Sep 3, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SNU-VGILab / InstantDrag
View on GitHub
InstantDrag: Improving Interactivity in Drag-based Image Editing
☆237May 28, 2026Updated last month
yangxiaofeng / rectified_flow_prior
View on GitHub
Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]
☆142Apr 16, 2025Updated last year
camenduru / AdvancedLivePortrait-jupyter
View on GitHub
☆11Sep 28, 2024Updated last year
18445864529 / MAVIN
View on GitHub
Official PyTorch implementation of paper MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling
☆13Oct 5, 2024Updated last year
KwonGihyun / TweedieMix
View on GitHub
Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)
☆62Jan 22, 2025Updated last year
jianzongwu / MotionBooth
View on GitHub
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆138Oct 8, 2024Updated last year
KAIST-Visual-AI-Group / StochSync
View on GitHub
Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…
☆21Jun 24, 2025Updated last year
hohonu-vicml / DirectedDiffusion
View on GitHub
Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)
☆82Feb 22, 2024Updated 2 years ago
ai-forever / Kandinsky-4
View on GitHub
Text and image to video generation: Kandinsky 4.0 (2024)
☆150Dec 17, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
limuloo / DreamRenderer
View on GitHub
[ICCV 2025] DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models (official implement)
☆156May 21, 2025Updated last year
WUyinwei-hah / IFAdapter
View on GitHub
[ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".
☆62Jun 27, 2025Updated last year
byeongjun-park / SteerX
View on GitHub
[ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"
☆51Mar 20, 2025Updated last year
showlab / EvolveDirector
View on GitHub
[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
☆52Oct 14, 2024Updated last year
SusungHong / SEG-SDXL
View on GitHub
Official implementation of the paper "Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention" (Neu…
☆138Oct 3, 2024Updated last year
KAIST-Visual-AI-Group / SyncTweedies
View on GitHub
Official implementation of SyncTweedies: A General Generative Framework Based on Synchronized Diffusions (NeurIPS 2024)
☆69Aug 4, 2024Updated last year
ExplainableML / ImageSelect
View on GitHub
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Jul 10, 2023Updated 3 years ago
wufeim / imagenet3d
View on GitHub
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
☆21Dec 6, 2024Updated last year
LAION-AI / conditioned-prior
View on GitHub
(wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.
☆29Jul 14, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KAIST-Visual-AI-Group / Token-Warping-MLLM
View on GitHub
☆22Mar 31, 2026Updated 3 months ago
camenduru / FluxMusic-jupyter
View on GitHub
☆18Sep 4, 2024Updated last year
snap-research / SF-V
View on GitHub
This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.
☆99Nov 27, 2024Updated last year
SHI-Labs / Smooth-Diffusion
View on GitHub
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024
☆355Sep 24, 2024Updated last year
paintscene4d / paintscene4d.github.io
View on GitHub
☆25Mar 30, 2025Updated last year
yahoojapan / srgd
View on GitHub
Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]
☆19Dec 9, 2024Updated last year
Ground-A-Score / Ground-A-Score
View on GitHub
Official PyTorch implementation of "Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing"
☆11Apr 4, 2024Updated 2 years ago