Unlocking Iterative Reasoning for Any Image Editor
☆99Jan 18, 2026Updated 2 months ago
Alternatives and similar repositories for EditThinker
Users that are interested in EditThinker are comparing it to the libraries listed below
Sorting:
- ☆49Feb 9, 2026Updated last month
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆27Aug 7, 2025Updated 7 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 3 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆53Mar 5, 2026Updated 2 weeks ago
- (CVPR 2026) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject composition …☆28Jan 14, 2026Updated 2 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆214Mar 11, 2026Updated last week
- DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models☆177Jan 4, 2026Updated 2 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- ☆53Dec 10, 2025Updated 3 months ago
- ComfyUI version of WithAnyone☆24Dec 18, 2025Updated 3 months ago
- Spatial Aptitude Training for Multimodal Langauge Models☆25Feb 8, 2026Updated last month
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆30Feb 28, 2026Updated 3 weeks ago
- The official repository of paper "Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark"☆20Jun 20, 2025Updated 9 months ago
- maze datasets for investigating OOD behavior of ML systems☆74Jan 19, 2026Updated 2 months ago
- Official repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World☆41Jul 8, 2025Updated 8 months ago
- Reimplementation of D4RT☆38Dec 26, 2025Updated 2 months ago
- This is the official repository for "BokehDiff: Neural Lens Blur with One-Step Diffusion" (ICCV'25).☆46Sep 12, 2025Updated 6 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆54Jan 7, 2026Updated 2 months ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 2 months ago
- ☆14Jun 2, 2025Updated 9 months ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆13Jan 22, 2025Updated last year
- ☆49Oct 6, 2024Updated last year
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- ☆22Feb 13, 2026Updated last month
- Investigating and Defending Shortcut Learning in Personalized Diffusion Models☆13Nov 19, 2024Updated last year
- ☆17May 10, 2023Updated 2 years ago
- ☆28Aug 14, 2024Updated last year
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆70Dec 2, 2025Updated 3 months ago
- ☆11Jun 3, 2023Updated 2 years ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆30Dec 31, 2025Updated 2 months ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆31Jan 6, 2026Updated 2 months ago
- ☆12Apr 26, 2024Updated last year
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆33Dec 27, 2025Updated 2 months ago
- The official paper summary of TMLR'25 paper "Survey of Video Diffusion Models: Foundations, Implementations, and Applications"☆38Feb 2, 2026Updated last month
- Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization(ACM MM2024)☆18Mar 31, 2025Updated 11 months ago