zheng0116 / ImageRetrievalLinks
This project is an image retrieval system based on DINOv2 and CLIP models. It uses Chroma vector database to support both text-to-image and image-to-image retrieval.
☆27Updated last month
Alternatives and similar repositories for ImageRetrieval
Users that are interested in ImageRetrieval are comparing it to the libraries listed below
Sorting:
- This repository represents the official implementation of the paper titled "Towards Generalizable Scene Change Detection (CVPR 2025)".☆66Updated 2 months ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆60Updated 2 months ago
- Official implementation and datasets of AddressCLIP☆66Updated last year
- The official implementation of "Segment Anything with Multiple Modalities".☆111Updated last year
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆106Updated 2 months ago
- [ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…☆170Updated 3 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆109Updated last year
- Official PyTorch implementation of WPS from our paper: WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models☆13Updated 7 months ago
- ☆57Updated 3 months ago
- Code for <Zero-Shot Scene Change Detection> in AAAI 2025☆51Updated 7 months ago
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆75Updated 6 months ago
- [TPAMI2025&CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆188Updated last year
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆68Updated last week
- Official implementation of "Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation".☆32Updated 2 months ago
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"☆17Updated 2 months ago
- SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images (ECCV 2024)☆25Updated 8 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆82Updated 3 months ago
- [AAAI2026] X-SAM: From Segment Anything to Any Segmentation☆354Updated last week
- [ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.☆101Updated 7 months ago
- [ICRA-2025] Robust Scene Change Detection Using Visual Foundation Models and Cross-Attention Mechanisms☆42Updated 2 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆149Updated last month
- [AAAI 2024] VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning☆14Updated last year
- ☆10Updated last year
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆48Updated 4 months ago
- Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer☆13Updated last year
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Updated 7 months ago
- This repo contains a curative list of scene change detection(SCD), including papers, videos, codes, and related websites.☆120Updated last month
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆65Updated 9 months ago
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆58Updated 6 months ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Updated 2 years ago