HKUST-LongGroup / CLIPDragLinks
[ICLR2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.
☆20Updated 8 months ago
Alternatives and similar repositories for CLIPDrag
Users that are interested in CLIPDrag are comparing it to the libraries listed below
Sorting:
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆213Updated 9 months ago
- This is a repository to collect training-free algorithms for visual generation and manipulation☆205Updated this week
- Official code of SmartEdit [CVPR-2024 Highlight]☆370Updated last year
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆329Updated last month
- 🚀 Cross attention map tools for huggingface/diffusers☆384Updated 2 weeks ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆410Updated 4 months ago
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆383Updated last year
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆96Updated 2 years ago
- ☆175Updated 7 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆126Updated 7 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆139Updated last month
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆39Updated 2 years ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆319Updated 9 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆42Updated 10 months ago
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆273Updated 2 months ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆31Updated last year
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆206Updated 6 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆185Updated 8 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆157Updated 3 weeks ago
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆83Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆148Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Updated last year
- ☆13Updated last year
- A collection of resources on personalized image generation.☆230Updated last month
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆120Updated last month
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆182Updated 2 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Updated 10 months ago
- A collection of vision foundation models unifying understanding and generation.☆59Updated last year
- Official Implementation of VideoDPO☆157Updated 8 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆292Updated 9 months ago