Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.
☆56Mar 31, 2026Updated last month
Alternatives and similar repositories for UniREditBench
Users that are interested in UniREditBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆129Jan 29, 2026Updated 3 months ago
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆25Sep 27, 2024Updated last year
- ☆23Oct 28, 2024Updated last year
- EraseAnything, ICML 2025☆40Sep 28, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆75Apr 15, 2025Updated last year
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Apr 14, 2024Updated 2 years ago
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆33Aug 5, 2025Updated 9 months ago
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 5 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆227Apr 14, 2026Updated last month
- [ICLR'26] SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models☆39Mar 9, 2026Updated 2 months ago
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated last year
- ThinkGen: Generalized Thinking for Visual Generation☆53Dec 30, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR'25] Reconstructive Visual Instruction Tuning☆134Apr 9, 2025Updated last year
- Code Implementation of “RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers”☆32Apr 13, 2026Updated last month
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated last year
- ☆11Nov 5, 2024Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 4 months ago
- ☆23Mar 17, 2026Updated 2 months ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆13Jan 22, 2025Updated last year
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆97Nov 21, 2025Updated 5 months ago
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆134Apr 3, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆40Nov 24, 2025Updated 5 months ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- Knowledge Guided Multi-instance Multi-label Networks (KG-MIML-Net) for Medicines Prediction☆13Oct 2, 2018Updated 7 years ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆32Aug 21, 2025Updated 8 months ago
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆109Jul 18, 2025Updated 10 months ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆37Apr 2, 2026Updated last month
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"? [ICLR26]☆39Jun 23, 2025Updated 10 months ago
- [CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner☆184Feb 22, 2026Updated 2 months ago
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning☆54Mar 26, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reimplementation of D4RT☆43Dec 26, 2025Updated 4 months ago
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆106Apr 23, 2025Updated last year
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆63Mar 23, 2026Updated last month
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆624Jan 5, 2026Updated 4 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Nov 11, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆460Aug 8, 2025Updated 9 months ago