MiZhenxing/ThinkDiff

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MiZhenxing/ThinkDiff)

MiZhenxing / ThinkDiff

ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

☆191

Alternatives and similar repositories for ThinkDiff

Users that are interested in ThinkDiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

W-Ted / UDC-NeRF
View on GitHub
Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
☆34Dec 27, 2023Updated 2 years ago
xyq7 / Human-Contribution-Measurement
View on GitHub
☆13Jun 4, 2025Updated last year
LiuJF1226 / Mono4DGS-HDR
View on GitHub
[ICLR 2026] Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos
☆29May 29, 2026Updated last month
ShaelynZ / synergize-motion-appearance
View on GitHub
[CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"
☆65Jul 3, 2026Updated 2 weeks ago
wzpscott / hybrid-radiance-fields
View on GitHub
[NeurIPS'25] HyRF: Hybrid Radiance Fields for Efficient and High-quality Novel View Synthesis
☆75Dec 17, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MiZhenxing / alpha_visualizer
View on GitHub
Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)
☆13Mar 27, 2023Updated 3 years ago
stevejaehyeok / MoCo-NeRF
View on GitHub
☆11Jul 17, 2024Updated 2 years ago
yangcaoai / 3DGS-DET
View on GitHub
Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object …
☆165Mar 16, 2026Updated 4 months ago
MiZhenxing / One4D
View on GitHub
[ECCV 2026] One4D: Unified 4D Generation and Reconstruction
☆115Jun 18, 2026Updated last month
rt219 / LatentGuard
View on GitHub
This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"
☆54Oct 24, 2024Updated last year
yangcaoai / VGGT-Det-CVPR2026
View on GitHub
Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection
☆144Updated this week
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
sterzhang / PVIT
View on GitHub
Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated last year
pipilurj / bootstrapped-preference-optimization-BPO
View on GitHub
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆63Aug 23, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
W-Ted / GScream
View on GitHub
Official code for ECCV2024 paper: GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal
☆104Nov 25, 2025Updated 7 months ago
zhongyingji / guidedvd-3dgs
View on GitHub
Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)
☆138Sep 18, 2025Updated 10 months ago
jianzongwu / MotionBooth
View on GitHub
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆138Oct 8, 2024Updated last year
MiZhenxing / Switch-NeRF
View on GitHub
Codes for Switch-NeRF (ICLR 2023)
☆211Aug 25, 2025Updated 10 months ago
MiZhenxing / GBi-Net
View on GitHub
Codes for GBi-Net (CVPR2022)
☆129Jul 20, 2023Updated 3 years ago
prismformore / DiffusionMTL
View on GitHub
Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data
☆60Mar 25, 2024Updated 2 years ago
wzpscott / EvINR
View on GitHub
☆15Aug 5, 2024Updated last year
pipilurj / ROBOT
View on GitHub
☆27Apr 11, 2023Updated 3 years ago
zrealli / LDGen
View on GitHub
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
☆38Mar 3, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Visualignment / SafetyDPO
View on GitHub
☆34Aug 26, 2025Updated 10 months ago
yangcaoai / CoDA_NeurIPS2023
View on GitHub
Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detec…
☆222May 28, 2026Updated last month
yjw1029 / Self-Reminder
View on GitHub
Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.
☆57Nov 13, 2023Updated 2 years ago
TIGER-AI-Lab / OmniEdit
View on GitHub
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
☆144Jan 27, 2025Updated last year
JIA-Lab-research / DreamOmni3
View on GitHub
This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''
☆40Dec 30, 2025Updated 6 months ago
qwang666 / RoomTex-
View on GitHub
[ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting
☆32Sep 3, 2024Updated last year
lzyhha / VisualCloze
View on GitHub
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…
☆283Jan 7, 2026Updated 6 months ago
pipilurj / MLLM-protector
View on GitHub
The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"
☆46Apr 21, 2024Updated 2 years ago
prismformore / SDSEN
View on GitHub
☆20May 26, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wyhlovecpp / GPT-Image-Edit
View on GitHub
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆243Aug 15, 2025Updated 11 months ago
VARGPT-family / VARGPT-v1.1
View on GitHub
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
☆271Apr 15, 2025Updated last year
feizc / Video-In-Context
View on GitHub
Video Diffusion Transformers are In-Context Learners
☆37Jan 6, 2025Updated last year
guozinan126 / MUSAR
View on GitHub
☆30May 7, 2025Updated last year
yanchi-3dv / PG-Occ
View on GitHub
[ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocab…
☆34Feb 19, 2026Updated 5 months ago
pipilurj / DynaFed
View on GitHub
☆50Apr 1, 2023Updated 3 years ago
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago