SHI-Labs/IMG-Multimodal-Diffusion-Alignment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SHI-Labs/IMG-Multimodal-Diffusion-Alignment)

SHI-Labs / IMG-Multimodal-Diffusion-Alignment

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025

☆30

Alternatives and similar repositories for IMG-Multimodal-Diffusion-Alignment

Users that are interested in IMG-Multimodal-Diffusion-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeapLabTHU / UniTTA
View on GitHub
☆21Mar 5, 2025Updated last year
star9988rr / VIPScene
View on GitHub
☆37Dec 2, 2025Updated 7 months ago
LeapLabTHU / RvR
View on GitHub
🔥 Regeneration over editing: unlocking more effective image refinement!
☆51May 26, 2026Updated last month
LeapLabTHU / AdaptiveNN-Jittor
View on GitHub
☆33May 27, 2026Updated last month
LeapLabTHU / DAT-Jittor
View on GitHub
Jittor implementation of Vision Transformer with Deformable Attention
☆32Mar 1, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LeapLabTHU / MOSS
View on GitHub
Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning
☆23Nov 16, 2022Updated 3 years ago
LeapLabTHU / diver-ct
View on GitHub
☆14Dec 19, 2024Updated last year
LeapLabTHU / ENAT
View on GitHub
[NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
☆25Nov 28, 2024Updated last year
LeapLabTHU / OVM3D-Det
View on GitHub
☆55Jan 2, 2025Updated last year
LeapLabTHU / FamO2O
View on GitHub
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆41Oct 30, 2023Updated 2 years ago
LeapLabTHU / CheckpointKD
View on GitHub
☆27Oct 6, 2022Updated 3 years ago
LeapLabTHU / AdaAFforPINNs
View on GitHub
☆19Aug 9, 2023Updated 2 years ago
LeapLabTHU / Text4Point
View on GitHub
☆37Jan 18, 2023Updated 3 years ago
LeapLabTHU / Attention-Mediators
View on GitHub
[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
☆47Sep 11, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LeapLabTHU / AdaFocusV2
View on GitHub
[CVPR 2022] Official repository of AdaFocusV2.
☆91Dec 15, 2024Updated last year
LeapLabTHU / SimPro
View on GitHub
[ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning
☆31Sep 30, 2024Updated last year
LeapLabTHU / AdaNAT
View on GitHub
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
☆37Sep 12, 2024Updated last year
LeapLabTHU / UltraHiT
View on GitHub
[ICRA 2026] UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography
☆18Mar 9, 2026Updated 4 months ago
LeapLabTHU / GridMix
View on GitHub
Repository of GridMix (ICLR 2025)
☆36Mar 18, 2025Updated last year
shihao1895 / SpatialActor
View on GitHub
[AAAI 2026 Oral] SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation
☆62Jun 13, 2026Updated last month
LeapLabTHU / Uni-AdaFocus
View on GitHub
Official repository of Uni-AdaFocus (TPAMI 2024).
☆59Dec 17, 2024Updated last year
LeapLabTHU / JustGRPO
View on GitHub
[ICML 2026 Outstanding Paper] Minimalist RL for Diffusion LLMs. 89.1% on GSM8K.
☆244Jul 6, 2026Updated 2 weeks ago
LeapLabTHU / InsightTok
View on GitHub
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation
☆37May 15, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LeapLabTHU / CODA
View on GitHub
CODA: Repurposing Continuous VAEs for Discrete Tokenization
☆37Jul 4, 2025Updated last year
LeapLabTHU / DAT-Detection
View on GitHub
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…
☆21Apr 17, 2024Updated 2 years ago
LeapLabTHU / ImprovedNAT
View on GitHub
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
☆47Jun 13, 2024Updated 2 years ago
yueyang130 / SEEM
View on GitHub
Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
☆24Oct 30, 2023Updated 2 years ago
LeapLabTHU / Rank-DETR
View on GitHub
[NeurIPS 2023] Rank-DETR for High Quality Object Detection
☆106Oct 19, 2023Updated 2 years ago
LeapLabTHU / CheXWorld
View on GitHub
[CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
☆48Apr 21, 2025Updated last year
LeapLabTHU / InLine
View on GitHub
[NeurIPS 2024] Official repository of InLine attention
☆61Dec 22, 2024Updated last year
LeapLabTHU / DAT-Segmentation
View on GitHub
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…
☆26Sep 7, 2023Updated 2 years ago
LeapLabTHU / Deep-Incubation
View on GitHub
Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)
☆92Mar 16, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
LeapLabTHU / Segment3D
View on GitHub
☆98Dec 29, 2024Updated last year
LeapLabTHU / ARC
View on GitHub
[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection
☆145Mar 15, 2025Updated last year
alibaba-damo-academy / T2I-Distill
View on GitHub
[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide
☆368Dec 31, 2025Updated 6 months ago
LeapLabTHU / EfficientTrain
View on GitHub
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…
☆231Aug 23, 2024Updated last year
LeapLabTHU / DAPrompt
View on GitHub
Pytorch implementation of DAPrompt: https://arxiv.org/abs/2202.06687
☆99Feb 12, 2023Updated 3 years ago
LeapLabTHU / Cross-Modal-Adapter
View on GitHub
[Pattern Recognition 2025] Cross-Modal Adapter for Vision-Language Retrieval
☆143Aug 17, 2025Updated 11 months ago