[ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring Expression
☆19Jun 27, 2025Updated 10 months ago
Alternatives and similar repositories for refedit
Users that are interested in refedit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆13Mar 7, 2026Updated last month
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 10 months ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆46Jun 27, 2025Updated 10 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 6 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆13Dec 10, 2022Updated 3 years ago
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆28Aug 7, 2025Updated 8 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 6 months ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- ✨ PyTorch implementation of "Cora: Correspondence-aware Image Editing Using Few-Step Diffusion", accepted at SIGGRAPH 2025.☆34Jun 3, 2025Updated 10 months ago
- ☆17May 26, 2023Updated 2 years ago
- NLP tool for wide-range model reliability evaluations☆12Jun 18, 2023Updated 2 years ago
- ☆11Apr 4, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- ☆29Feb 27, 2025Updated last year
- The official repo for the DanQing dataset.☆35Mar 25, 2026Updated last month
- EN1060 lectures☆11Jan 25, 2026Updated 3 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆52Dec 30, 2025Updated 4 months ago
- ☆120Jan 27, 2025Updated last year
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆12Sep 21, 2023Updated 2 years ago
- code for "PGMAN: An Unsupervised Generative Multi-adversarial Network for Pan-sharpening"☆19Mar 8, 2022Updated 4 years ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆47Aug 26, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2023] Official PyTorch implementation of "A Multidimensional Analysis of Social Biases in Vision Transformers"☆13Aug 11, 2023Updated 2 years ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 10 months ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆44Oct 19, 2025Updated 6 months ago
- WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models (CVPR 2024)☆26Jun 14, 2024Updated last year
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆89Jun 6, 2025Updated 10 months ago
- ☆26Jun 20, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆105Jul 5, 2024Updated last year
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆107Feb 25, 2026Updated 2 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning☆52Mar 26, 2026Updated last month
- Rethinking the Trust Region in LLM Reinforcement Learning☆52Mar 2, 2026Updated 2 months ago
- ☆28Apr 25, 2025Updated last year
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆18Mar 7, 2025Updated last year
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 9 months ago
- ☆34Nov 18, 2025Updated 5 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year