1e12Leon / ProbDet
☆20Updated last year
Related projects: ⓘ
- Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images, TGRS 2024.☆20Updated last week
- A User-Friendly Toolkit for UAV Light-weighting Object Detection☆12Updated last year
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆107Updated 9 months ago
- This is the official implementation for our TGRS 2024 paper "Text-Guided Diverse Image Synthesis for Long-Tailed Remote Sensing Object Cl…☆11Updated 2 months ago
- This is the code of the paper "Towards Generalized UAV Object Detection: A Novel Perspective from Frequency Domain Disentanglement",which…☆13Updated 7 months ago
- ☆16Updated 8 months ago
- 🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (R…☆36Updated 5 months ago
- [CVPR23] Visual Prompt Multi-Modal Tracking☆244Updated last year
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆82Updated 5 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆187Updated last month
- ☆48Updated last month
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆83Updated last year
- [CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Ref…☆114Updated last month
- ☆78Updated 5 months ago
- ☆10Updated 5 months ago
- ☆55Updated 10 months ago
- ☆116Updated 6 months ago
- Official LEVIR-CC dataset and Pytorch implementation for Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Meth…☆104Updated 4 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆168Updated last month
- RS5M: a large-scale vision language dataset for remote sensing☆191Updated 3 weeks ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆39Updated 5 months ago
- Open-vocabulary Semantic Segmentation☆296Updated 4 months ago
- Official pytorch implementation of paper "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"☆25Updated last year
- ☆17Updated 4 months ago
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆109Updated 9 months ago
- [CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segme…☆216Updated last week
- ☆23Updated last month
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆270Updated 2 months ago
- ☆16Updated this week
- Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey☆117Updated this week