[ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring Expression
☆18Jun 27, 2025Updated 8 months ago
Alternatives and similar repositories for refedit
Users that are interested in refedit are comparing it to the libraries listed below
Sorting:
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 8 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆20Nov 1, 2025Updated 4 months ago
- ☆11Nov 30, 2025Updated 3 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 4 months ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆44Jun 27, 2025Updated 8 months ago
- ☆16May 26, 2023Updated 2 years ago
- ✨ PyTorch implementation of "Cora: Correspondence-aware Image Editing Using Few-Step Diffusion", accepted at SIGGRAPH 2025.☆32Jun 3, 2025Updated 9 months ago
- ☆27Feb 27, 2025Updated last year
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆27Aug 7, 2025Updated 6 months ago
- The official implementation of RS-STE proposed by our paper Recognition-Synergistic Scene Text Editing (CVPR 2025).☆29Jul 15, 2025Updated 7 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 2 months ago
- ☆26Jun 20, 2024Updated last year
- ☆27Apr 25, 2025Updated 10 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 6 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 6 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆135Jan 29, 2026Updated last month
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆40Oct 19, 2025Updated 4 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 8 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆88Jun 6, 2025Updated 8 months ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- EN1060 lectures☆11Jan 25, 2026Updated last month
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Oct 18, 2023Updated 2 years ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆92Jun 24, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- ☆10Apr 7, 2025Updated 10 months ago
- This repository extends the mask editor in Comfyui and supports lasso method for applying masks☆14Jul 23, 2025Updated 7 months ago
- ComfyUI custom node to automate batch generation with randomize prompts from text files. It mimics Forge's functionality, allowing you to…☆13Aug 23, 2025Updated 6 months ago
- [ICCV 2025] "Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning".☆16Dec 11, 2025Updated 2 months ago
- AI-powered LinkedIn job application bot with Playwright automation, LLM integration, data anonymization, and Telegram reporting. Applies …☆37Feb 3, 2026Updated last month
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆39Jul 19, 2024Updated last year
- ☆12May 26, 2022Updated 3 years ago
- ☆11Jan 18, 2025Updated last year
- ☆42Nov 8, 2024Updated last year
- Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…☆24Jun 4, 2025Updated 8 months ago