Unlocking Iterative Reasoning for Any Image Editor
☆89Jan 18, 2026Updated last month
Alternatives and similar repositories for EditThinker
Users that are interested in EditThinker are comparing it to the libraries listed below
Sorting:
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆27Aug 7, 2025Updated 6 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- ☆27Updated this week
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 8 months ago
- Official repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World☆41Jul 8, 2025Updated 7 months ago
- [NeurIPS 2025]《SD-VLM: Spatial Measuring and Understanding with Depth-encoded Vision Language Models》☆37Dec 29, 2025Updated 2 months ago
- ☆48Feb 9, 2026Updated 3 weeks ago
- ☆22Aug 17, 2024Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆23Mar 18, 2025Updated 11 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Oct 31, 2024Updated last year
- [CVPR 2025] Test-Time Visual In-Context Tuning☆29Dec 31, 2025Updated 2 months ago
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆52Jan 7, 2026Updated last month
- ☆53Dec 10, 2025Updated 2 months ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 5 months ago
- This is the official repository for "BokehDiff: Neural Lens Blur with One-Step Diffusion" (ICCV'25).☆46Sep 12, 2025Updated 5 months ago
- The impletation of paper titled GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis☆20Nov 23, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)☆35Aug 1, 2025Updated 7 months ago
- Official implementation of the paper "Watermarking Autoregressive Image Generation" (NeurIPS'25)☆58Sep 19, 2025Updated 5 months ago
- [AAAI 2026] Few-step Flow for 3D Generation via Marginal-Data Transport Distillation☆50Jan 9, 2026Updated last month
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆68Dec 2, 2025Updated 3 months ago
- Official Repository of "ROSE: Remove Objects with Side Effects in Videos"☆137Oct 15, 2025Updated 4 months ago
- The official SpeakerVid-5M data curation code.☆68Jul 23, 2025Updated 7 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆88Nov 4, 2024Updated last year
- This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video…☆104Apr 9, 2025Updated 10 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- ☆34Dec 29, 2025Updated 2 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- Finetune your VAE on private datasets!☆37Jun 20, 2024Updated last year
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Apr 9, 2025Updated 10 months ago
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆20Feb 17, 2026Updated last week
- DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models☆170Jan 4, 2026Updated last month
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆19Feb 5, 2026Updated 3 weeks ago
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆158Sep 12, 2025Updated 5 months ago
- ☆43Dec 1, 2025Updated 3 months ago
- A script from Mike O'Driscoll to toggle Tailscale exit nodes from a GL.iNet physical switch.☆26Jan 5, 2026Updated last month
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆30Sep 29, 2025Updated 5 months ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆19Jan 6, 2026Updated last month
- Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence☆259Feb 13, 2026Updated 2 weeks ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 2 months ago