HKUST-LongGroup / CLIPDragLinks
[ICLR2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.
☆20Updated 9 months ago
Alternatives and similar repositories for CLIPDrag
Users that are interested in CLIPDrag are comparing it to the libraries listed below
Sorting:
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆214Updated 10 months ago
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆83Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆330Updated last month
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Updated 10 months ago
- This is a repository to collect training-free algorithms for visual generation and manipulation☆209Updated last week
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆140Updated last week
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆96Updated 2 years ago
- Unified layout planning and image generation, ICCV2025☆40Updated 3 weeks ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆126Updated 7 months ago
- An unofficial implement of DiffEdit on stable-diffusion☆82Updated 3 years ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆31Updated last year
- Official code of SmartEdit [CVPR-2024 Highlight]☆370Updated last year
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆188Updated last year
- ☆119Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆149Updated last year
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆39Updated 2 years ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Updated 5 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186Updated 8 months ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Updated 5 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Updated last year
- my attempt at implementing the DiffEdit paper (WIP)☆16Updated 3 years ago
- LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer☆49Updated last month
- 🚀 Cross attention map tools for huggingface/diffusers☆388Updated last week
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆413Updated 4 months ago
- Official Implementation of VideoDPO☆160Updated 8 months ago
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆104Updated 3 months ago
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆386Updated last year
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Updated last year
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆74Updated last year
- [CVPR25] IAR☆17Updated 7 months ago