A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆15Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for PIR-pytorch
Users that are interested in PIR-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.☆16Aug 10, 2024Updated last year
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆27Dec 19, 2025Updated 3 months ago
- Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023☆29Jan 14, 2024Updated 2 years ago
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆66Mar 10, 2025Updated last year
- Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"☆72Oct 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 3 years ago
- The code for "Semi-Supervised Cross-Modal Hashing with Multi-view Graph Representation"☆11Apr 18, 2021Updated 4 years ago
- ☆23Sep 19, 2024Updated last year
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆535Jun 27, 2024Updated last year
- Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019☆13Mar 14, 2021Updated 5 years ago
- ☆14Oct 10, 2022Updated 3 years ago
- Collection of Remote Sensing Vision-Language Models☆142May 13, 2024Updated last year
- Label Embedding Online Hashing for Cross-Modal Retrieval☆13Sep 22, 2025Updated 6 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆304Mar 17, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Source code of our TOMM 2019 paper "CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning".☆19Apr 18, 2019Updated 6 years ago
- Source code for ICMR'19 paper "Triplet Fusion Network Hashing for Unpaired Cross-Modal Retrieval"☆18Mar 22, 2025Updated last year
- Code of Learning Cross-view Visual Geo-localization without Ground Truth☆11Feb 17, 2025Updated last year
- PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-…☆41Nov 1, 2022Updated 3 years ago
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆32Feb 11, 2026Updated 2 months ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)☆230Nov 28, 2021Updated 4 years ago
- Code for the "Evolving Reservoirs for Meta Reinforcement Learning" paper☆11Apr 22, 2024Updated last year
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆27Jul 21, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆28Feb 7, 2024Updated 2 years ago
- ☆10Feb 21, 2023Updated 3 years ago
- ☆18May 10, 2023Updated 2 years ago
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated last year
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- ☆12Mar 14, 2024Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆73Jan 2, 2024Updated 2 years ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆27Jun 20, 2021Updated 4 years ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- ☆19Apr 5, 2024Updated 2 years ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆62Nov 22, 2023Updated 2 years ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆51Jul 25, 2023Updated 2 years ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year