[ICLR2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.
☆20May 6, 2025Updated 9 months ago
Alternatives and similar repositories for CLIPDrag
Users that are interested in CLIPDrag are comparing it to the libraries listed below
Sorting:
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 8 months ago
- ☆13Dec 18, 2024Updated last year
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Dec 5, 2022Updated 3 years ago
- Human-like Controllable Image Captioning with Verb-specific Semantic Roles.☆36Mar 11, 2022Updated 3 years ago
- Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection☆43Jun 4, 2024Updated last year
- ☆12Jun 7, 2023Updated 2 years ago
- study for python, ML, DL, Data Science, etc☆15Feb 6, 2026Updated 3 weeks ago
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆14Sep 25, 2025Updated 5 months ago
- [2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트☆14Jun 11, 2022Updated 3 years ago
- Best Paper Awards in Top Conferences of Artificial Intelligence☆23Oct 11, 2025Updated 4 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- ☆16Nov 7, 2023Updated 2 years ago
- ☆11Jul 26, 2024Updated last year
- ☆14Sep 11, 2025Updated 5 months ago
- Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)☆10Apr 8, 2020Updated 5 years ago
- ☆11Feb 22, 2024Updated 2 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- arXiv Paper Portal in Computer Science of 2025-2026☆20Jan 4, 2026Updated last month
- The official implementation of NeurlPS 2025 D&B paper: IndustryEQA: Pushing the frontiers of Embodied Question Answering in Industrial Sc…☆12Sep 25, 2025Updated 5 months ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Oct 4, 2025Updated 4 months ago
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- Tensorflow implementation of deformable conv and pooling operations.☆10Jul 17, 2017Updated 8 years ago
- 基于机器视觉的智能辅助驾驶应用,作为系统的移动端衍生,使用Flutter进行开发,适配双端(Android、IOS)☆12Jun 16, 2019Updated 6 years ago
- optimizing class activation maps by causal inference for weakly-supervised object localization task☆11May 5, 2022Updated 3 years ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Jan 15, 2026Updated last month
- [CVPR'25 Highlight ]Official code for the paper "Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning"☆14Jun 15, 2025Updated 8 months ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆48Jul 11, 2023Updated 2 years ago
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆40Nov 20, 2025Updated 3 months ago
- To explain the process of warping an image using optical flow of the sequence☆12Sep 13, 2021Updated 4 years ago
- Official Repository of Recovering Dynamic 3D Sketches from Videos (CVPR 2025)☆14May 6, 2025Updated 9 months ago
- Official codebase for AdaRank: Adaptive Rank Pruning for Enhanced Model Merging (ICLR 2026)☆16Jan 26, 2026Updated last month
- [CVPR-2024] NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation☆16Oct 19, 2024Updated last year
- MOFY: MOsaic For You 실시간 불특정 인물 비식별화☆13Jun 22, 2022Updated 3 years ago
- K-means algorithm implementation in Javascript.☆20Jan 19, 2021Updated 5 years ago
- Pythonic refactor of the OpenGL Redbook example source (http://www.opengl.org/resources/code/samples/redbook/). I hope you find it usefu…☆12Oct 5, 2011Updated 14 years ago
- PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)☆14Jun 14, 2024Updated last year
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆17Nov 4, 2025Updated 3 months ago
- ☆28Jan 15, 2026Updated last month