A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆15Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for PIR-pytorch
Users that are interested in PIR-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.☆16Aug 10, 2024Updated last year
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆66Mar 10, 2025Updated last year
- Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"☆72Oct 25, 2023Updated 2 years ago
- [ACMMM'23 Oral] Official Code for “A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval”☆50Jan 19, 2024Updated 2 years ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆555Jun 27, 2024Updated last year
- Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019☆13Mar 14, 2021Updated 5 years ago
- An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].☆14Jul 27, 2024Updated last year
- YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)基于YOLOv5的旋转目标检测☆10Mar 27, 2021Updated 5 years ago
- ☆14Oct 10, 2022Updated 3 years ago
- ☆21Dec 19, 2025Updated 5 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆55Jun 10, 2025Updated last year
- Label Embedding Online Hashing for Cross-Modal Retrieval☆13Sep 22, 2025Updated 8 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆312Mar 17, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code of our TOMM 2019 paper "CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning".☆19Apr 18, 2019Updated 7 years ago
- Source code for ICMR'19 paper "Triplet Fusion Network Hashing for Unpaired Cross-Modal Retrieval"☆18Mar 22, 2025Updated last year
- Summary of Related Research on Image-Text Matching☆74May 20, 2023Updated 3 years ago
- Code of Learning Cross-view Visual Geo-localization without Ground Truth☆11Feb 17, 2025Updated last year
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"☆207Dec 10, 2024Updated last year
- PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-…☆41Nov 1, 2022Updated 3 years ago
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆33Feb 11, 2026Updated 4 months ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)☆234Nov 28, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the "Evolving Reservoirs for Meta Reinforcement Learning" paper☆12Apr 22, 2024Updated 2 years ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆28Jul 21, 2025Updated 10 months ago
- ☆28Feb 7, 2024Updated 2 years ago
- 基于c++ muduo网络库的集群聊天服务器,使用nginx实现负载均衡,使用reids消息队列实现跨服务器通信☆12Feb 23, 2024Updated 2 years ago
- ☆10Feb 21, 2023Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- ☆12Mar 14, 2024Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆74Apr 10, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 4 months ago
- ☆27Jun 20, 2021Updated 4 years ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆64Nov 22, 2023Updated 2 years ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated 2 years ago
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆51Jul 25, 2023Updated 2 years ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year