ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
☆192Sep 7, 2025Updated 9 months ago
Alternatives and similar repositories for ThinkDiff
Users that are interested in ThinkDiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis☆34Dec 27, 2023Updated 2 years ago
- ☆13Jun 4, 2025Updated last year
- [NeurIPS'25] HyRF: Hybrid Radiance Fields for Efficient and High-quality Novel View Synthesis☆76Dec 17, 2025Updated 5 months ago
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"☆65Jun 6, 2025Updated last year
- [ICLR 2026] Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos☆28May 29, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)☆13Mar 27, 2023Updated 3 years ago
- ☆11Jul 17, 2024Updated last year
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object …☆164Mar 16, 2026Updated 2 months ago
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"☆54Oct 24, 2024Updated last year
- Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection☆129Apr 14, 2026Updated last month
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆122Mar 4, 2025Updated last year
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆63Aug 23, 2024Updated last year
- Official code for ECCV2024 paper: GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal☆104Nov 25, 2025Updated 6 months ago
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆135Sep 18, 2025Updated 8 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Codes for Switch-NeRF (ICLR 2023)☆211Aug 25, 2025Updated 9 months ago
- Codes for GBi-Net (CVPR2022)☆129Jul 20, 2023Updated 2 years ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆60Mar 25, 2024Updated 2 years ago
- ☆34Aug 26, 2025Updated 9 months ago
- ☆14Aug 5, 2024Updated last year
- ☆27Apr 11, 2023Updated 3 years ago
- LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation☆38Mar 3, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detec…☆221May 28, 2026Updated last week
- [ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆83May 26, 2026Updated 2 weeks ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆145Jan 27, 2025Updated last year
- Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.☆57Nov 13, 2023Updated 2 years ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆281Jan 7, 2026Updated 5 months ago
- [ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting☆32Sep 3, 2024Updated last year
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆244Aug 15, 2025Updated 9 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆46Apr 21, 2024Updated 2 years ago
- ☆20May 26, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆30May 7, 2025Updated last year
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆40Dec 30, 2025Updated 5 months ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆270Apr 15, 2025Updated last year
- [ICLR 2026] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-…☆68Apr 10, 2026Updated 2 months ago
- ☆50Apr 1, 2023Updated 3 years ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆315Sep 28, 2025Updated 8 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 10 months ago