ZhenyuLU-Heliodore / CoPRSLinks
Project Page for CoPRS, offering training overview, inference code, and downloadable links.
☆20Updated 3 months ago
Alternatives and similar repositories for CoPRS
Users that are interested in CoPRS are comparing it to the libraries listed below
Sorting:
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆66Updated 11 months ago
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆83Updated 9 months ago
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆26Updated last month
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆49Updated 11 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆115Updated 2 months ago
- vHeat: Building Vision Models upon Heat Conduction☆271Updated 8 months ago
- The code of "Logits DeConfusion with CLIP for Few-Shot Learning" (CVPR 2025)☆68Updated 8 months ago
- 【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification☆66Updated 11 months ago
- ☆24Updated last year
- [ACMMM'23 Oral] Official Code for “A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval”☆46Updated 2 years ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆348Updated last month
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆65Updated 9 months ago
- ☆77Updated last year
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆150Updated last year
- A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).☆36Updated 2 weeks ago
- [ICLR 2026] A novel cross-modal decoupling and alignment framework for multimodal representation learning.☆44Updated last week
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆20Updated 7 months ago
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆52Updated 7 months ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆391Updated 7 months ago
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆25Updated last year
- This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"☆35Updated last year
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.☆17Updated 5 months ago
- Source code of the paper Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification☆37Updated last year
- [AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segm…☆40Updated last year
- [ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segment…☆18Updated 3 weeks ago
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆36Updated 6 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…☆190Updated 3 weeks ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆50Updated 8 months ago
- [CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation☆31Updated 7 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆211Updated last year