zheng0116 / ImageRetrievalLinks
This project is an image retrieval system based on DINOv2 and CLIP models. It uses Chroma vector database to support both text-to-image and image-to-image retrieval.
☆26Updated last month
Alternatives and similar repositories for ImageRetrieval
Users that are interested in ImageRetrieval are comparing it to the libraries listed below
Sorting:
- Official implementation and datasets of AddressCLIP☆67Updated last year
- This repository represents the official implementation of the paper titled "Towards Generalizable Scene Change Detection (CVPR 2025)".☆63Updated 2 weeks ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆53Updated last week
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆107Updated 10 months ago
- ☆44Updated last month
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆60Updated 7 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Updated 6 months ago
- Adapting Dense Matching for Homography Estimation with Grid-based Acceleration (CVPR'25)☆22Updated 2 months ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆221Updated 2 months ago
- [IEEE JSTARS 2024] CV-Cities: Advancing Cross-view Geo-localization in Global Cities☆57Updated 3 months ago
- [AAAI2026] X-SAM: From Segment Anything to Any Segmentation☆333Updated 3 weeks ago
- The official implementation of "Segment Anything with Multiple Modalities".☆108Updated last year
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆111Updated last year
- RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension☆27Updated last week
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆42Updated 5 months ago
- Attention-guided Feature Distillation for Semantic Segmentation☆41Updated last month
- ☆64Updated last month
- [ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…☆155Updated last month
- [TPAMI2025&CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆187Updated last year
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆13Updated 9 months ago
- SmartCLIP: A training method to improve CLIP with both short and long texts☆30Updated 6 months ago
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆45Updated 2 months ago
- The official code of our CVPR2025 paper: "Segment Any-Quality Images with Generative Latent Space Enhancement".☆30Updated 2 months ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆387Updated 5 months ago
- [ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.☆97Updated 5 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆98Updated 3 weeks ago
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆73Updated 4 months ago
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆65Updated last month
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆35Updated 8 months ago
- ☆140Updated 4 months ago