A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆15Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for PIR-pytorch
Users that are interested in PIR-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.☆16Aug 10, 2024Updated last year
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆26Dec 19, 2025Updated 3 months ago
- Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023☆29Jan 14, 2024Updated 2 years ago
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆66Mar 10, 2025Updated last year
- The first research for semantic localization☆28Dec 6, 2023Updated 2 years ago
- The source code of AMFMN and the dataset RSITMD☆217Oct 25, 2023Updated 2 years ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 3 years ago
- The code for "Semi-Supervised Cross-Modal Hashing with Multi-view Graph Representation"☆11Apr 18, 2021Updated 4 years ago
- ☆24Sep 19, 2024Updated last year
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆526Jun 27, 2024Updated last year
- Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019☆13Mar 14, 2021Updated 5 years ago
- An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].☆14Jul 27, 2024Updated last year
- ☆14Oct 10, 2022Updated 3 years ago
- Convert A XML_VOC annotations of the BDD100k dataset to YOLO format and training a custom dataset for vehicles with YOLOv5, YOLOv8☆17Apr 13, 2023Updated 2 years ago
- Collection of Remote Sensing Vision-Language Models☆142May 13, 2024Updated last year
- ☆19Dec 19, 2025Updated 3 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆51Jun 10, 2025Updated 9 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆301Mar 17, 2025Updated last year
- Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024☆17Oct 3, 2024Updated last year
- Source code for ICMR'19 paper "Triplet Fusion Network Hashing for Unpaired Cross-Modal Retrieval"☆18Mar 22, 2025Updated last year
- Multi-Spectral Remote Sensing Image Retrieval using Geospatial Foundation Models☆51Sep 18, 2025Updated 6 months ago
- Summary of Related Research on Image-Text Matching☆74May 20, 2023Updated 2 years ago
- Code of Learning Cross-view Visual Geo-localization without Ground Truth☆12Feb 17, 2025Updated last year
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"☆200Dec 10, 2024Updated last year
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆31Feb 11, 2026Updated last month
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆39Mar 27, 2025Updated 11 months ago
- Code for the "Evolving Reservoirs for Meta Reinforcement Learning" paper☆11Apr 22, 2024Updated last year
- 基于c++ muduo网络库的集群聊天服务器,使用nginx实现负载均衡,使用reids消息队列实现跨服务器通信☆11Feb 23, 2024Updated 2 years ago
- ☆10Feb 21, 2023Updated 3 years ago
- ☆18May 10, 2023Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- ☆12Mar 14, 2024Updated 2 years ago
- ☆16Apr 3, 2023Updated 2 years ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- ☆19Apr 5, 2024Updated last year
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆62Nov 22, 2023Updated 2 years ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆50Jul 25, 2023Updated 2 years ago