Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.
☆54Jan 7, 2026Updated 2 months ago
Alternatives and similar repositories for UniREditBench
Users that are interested in UniREditBench are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes official implementation☆16Feb 2, 2023Updated 3 years ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆128Jan 29, 2026Updated last month
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- ☆24Oct 28, 2024Updated last year
- [CVPR 2024] Official implementation of CVPR 2024 paper: "Doubly Abductive Counterfactual Inference for Text-based Image Editing"☆25Mar 8, 2024Updated 2 years ago
- EraseAnything, ICML 2025☆39Sep 28, 2025Updated 5 months ago
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆74Apr 15, 2025Updated 11 months ago
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Apr 14, 2024Updated last year
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 3 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆214Mar 11, 2026Updated last week
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated last year
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆135Apr 9, 2025Updated 11 months ago
- Code Implementation of “RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers”☆30Dec 27, 2025Updated 2 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆29Apr 15, 2025Updated 11 months ago
- Reimplementation of D4RT☆38Dec 26, 2025Updated 2 months ago
- ☆11Nov 5, 2024Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 2 months ago
- 哈尔滨工业大学2023春季学期编译系统课程实验、习题、课件以及期末复习材料☆11Jul 30, 2023Updated 2 years ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆13Jan 22, 2025Updated last year
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆125Mar 2, 2026Updated 2 weeks ago
- ☆20Updated this week
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆96Nov 21, 2025Updated 3 months ago
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆37Nov 24, 2025Updated 3 months ago
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆104Jul 18, 2025Updated 8 months ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 6 months ago
- [CVPR 2026] VideoCoF: Unified Video Editing with Temporal Reasoner☆158Feb 22, 2026Updated 3 weeks ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆33Dec 27, 2025Updated 2 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 8 months ago
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆105Apr 23, 2025Updated 10 months ago
- 依照The Annotated Transformer 的指导实现Transformer, 并加入进去详细的描述,适合小白☆11Feb 2, 2020Updated 6 years ago
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆591Jan 5, 2026Updated 2 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Nov 11, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆449Aug 8, 2025Updated 7 months ago
- ☆37Nov 13, 2025Updated 4 months ago
- [ICCV 2025] Official implementation of "What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?"☆19Aug 7, 2025Updated 7 months ago
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆87Jan 26, 2026Updated last month