Unlocking Iterative Reasoning for Any Image Editor
☆107Jan 18, 2026Updated 4 months ago
Alternatives and similar repositories for EditThinker
Users that are interested in EditThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆54Feb 9, 2026Updated 3 months ago
- RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models☆47Mar 29, 2026Updated last month
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆28Aug 7, 2025Updated 9 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 5 months ago
- ☆34Feb 24, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆37Dec 30, 2025Updated 4 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 11 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆227Apr 14, 2026Updated last month
- (CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…☆31Apr 9, 2026Updated last month
- ThinkGen: Generalized Thinking for Visual Generation☆53Dec 30, 2025Updated 4 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆64Mar 17, 2026Updated 2 months ago
- ☆55May 6, 2025Updated last year
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- ☆53Dec 10, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于unity视频播放器,暂时只支持android版本(IOS版可以基于这个原理开发),使用的是Opengl渲染!支持在线视频播放,本地视频播放,三种模式(普通模式,左右模式,上下模式);附带上android导出jar包给uinty使用,NDK开发经验c++.研究需要懂un…☆11Apr 12, 2017Updated 9 years ago
- ComfyUI custom node that adds a quick and visual UI selector for building prompts to the sidebar.☆13Sep 4, 2024Updated last year
- 🎉 TrustJudge is accepted to ICLR 2026!☆46Sep 27, 2025Updated 7 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆43Apr 12, 2026Updated last month
- Spatial Aptitude Training for Multimodal Langauge Models☆31Feb 8, 2026Updated 3 months ago
- GEditBench v2: A Human-Aligned Benchmark for General Image Editing☆53Apr 1, 2026Updated last month
- The official repository of paper "Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark"☆19Jun 20, 2025Updated 11 months ago
- ☆22Aug 17, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Neuromorphic vision papers and where to find them! -- A visualization of all institutes in the world that do neuromorphic vision research…☆18Mar 31, 2026Updated last month
- Official repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World☆45Jul 8, 2025Updated 10 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 9 months ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 4 months ago
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆56Mar 31, 2026Updated last month
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆15Jul 31, 2025Updated 9 months ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆13Jan 22, 2025Updated last year
- This is the official repository for "BokehDiff: Neural Lens Blur with One-Step Diffusion" (ICCV'25).☆49Apr 27, 2026Updated 3 weeks ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 3 months ago
- Investigating and Defending Shortcut Learning in Personalized Diffusion Models☆14Nov 19, 2024Updated last year
- ☆28Aug 14, 2024Updated last year
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆466Sep 24, 2025Updated 7 months ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆13Apr 11, 2025Updated last year
- Official Code for CVPR 2024 paper: Permutation Equivariance of Transformers and Its Applications.☆16Nov 12, 2024Updated last year
- [CVPR 2025] Test-Time Visual In-Context Tuning☆30Dec 31, 2025Updated 4 months ago