Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
☆20Nov 4, 2024Updated last year
Alternatives and similar repositories for KEDs
Users that are interested in KEDs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions☆15Dec 27, 2023Updated 2 years ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Oct 17, 2023Updated 2 years ago
- DMAOT ranked 1st in the VOTS 2023 challenge.☆17Dec 21, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆85Jul 4, 2024Updated last year
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆145Jan 5, 2026Updated 3 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆196Jul 31, 2025Updated 8 months ago
- Data pre-processing and training code on Open-X-Embodiment with pytorch☆11Jan 20, 2025Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 8 months ago
- ☆11Jun 11, 2025Updated 10 months ago
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆47Nov 16, 2024Updated last year
- The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.☆20Feb 22, 2023Updated 3 years ago
- [CVPR2022] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis☆100Jun 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2022] Visual Abductive Reasoning☆124Oct 22, 2024Updated last year
- This is the official implementation of "Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds" (Accepted at AAAI 2024).☆11May 4, 2024Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆90Jul 13, 2024Updated last year
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆21May 23, 2023Updated 2 years ago
- Pytorch implementation of Detective☆12Jul 11, 2024Updated last year
- The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"☆15May 4, 2024Updated last year
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆59Jun 21, 2023Updated 2 years ago
- ☆13Feb 12, 2023Updated 3 years ago
- [WWW 2025 Oral] ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning☆20Jul 2, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the official implementation of "Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation" (Accepted at AC…☆14Aug 24, 2024Updated last year
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated last year
- ☆22Sep 1, 2025Updated 7 months ago
- This repo consists of my implementation of DocFormerV2☆11Mar 31, 2024Updated 2 years ago
- [ICLR 2025] Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation☆20Mar 21, 2025Updated last year
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- ☆195Mar 5, 2025Updated last year
- List of resources for video retrieval.☆20Mar 17, 2022Updated 4 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆55Nov 26, 2024Updated last year
- vue+elementUI 创建的一个好看的UI页面。暂时无js代码,只作为UI展示。☆11Feb 4, 2023Updated 3 years ago
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆44Mar 25, 2024Updated 2 years ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago
- Implementation (R2R part) for the paper "Iterative Vision-and-Language Navigation"☆18Apr 4, 2024Updated 2 years ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Jan 2, 2024Updated 2 years ago