HKUST-LongGroup / CLIPDragLinks
[ICLR2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.
☆16Updated 3 months ago
Alternatives and similar repositories for CLIPDrag
Users that are interested in CLIPDrag are comparing it to the libraries listed below
Sorting:
- This is a repository to collect training-free algorithms for visual generation and manipulation☆107Updated last week
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆286Updated 4 months ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆355Updated last month
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆78Updated 9 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆164Updated 7 months ago
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆179Updated 5 months ago
- [ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"☆339Updated last year
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆144Updated 2 weeks ago
- 🚀 Cross attention map tools for huggingface/diffusers☆335Updated 7 months ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆92Updated last year
- ☆27Updated last year
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆245Updated 4 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆95Updated 6 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆174Updated 11 months ago
- Official code of SmartEdit [CVPR-2024 Highlight]☆352Updated last year
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆30Updated 7 months ago
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆16Updated 3 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆73Updated 7 months ago
- An unofficial implement of DiffEdit on stable-diffusion☆78Updated 2 years ago
- Official Implementation of VideoDPO☆137Updated 3 months ago
- A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space☆85Updated last year
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆24Updated 8 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆122Updated 2 months ago
- Benchmark for generative image models☆97Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆117Updated 10 months ago
- Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…☆27Updated 9 months ago
- a collection of awesome autoregressive visual generation models☆77Updated 4 months ago
- ☆25Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆123Updated 9 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆36Updated 5 months ago