sunzc-sunny / refdroneLinks
RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension
☆20Updated 3 weeks ago
Alternatives and similar repositories for refdrone
Users that are interested in refdrone are comparing it to the libraries listed below
Sorting:
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆94Updated 4 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆38Updated 3 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆101Updated 8 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆24Updated 4 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆42Updated 4 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆55Updated 2 months ago
- [CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆180Updated last year
- (ECCV 2024) Official repository of paper "Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection"☆21Updated 6 months ago
- [ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy☆32Updated this week
- Make Large Multimodal Models excel in object detection, ICCV 2025☆47Updated 2 months ago
- The Project of ECCV 2024 Oral Paper "Oriented Object Detection vis Point-Axis Representation"☆64Updated 10 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆241Updated 2 months ago
- ☆21Updated last year
- ☆31Updated last year
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆155Updated 2 weeks ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆198Updated 6 months ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆201Updated 3 weeks ago
- ☆24Updated last month
- [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.☆85Updated 3 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆68Updated 5 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆147Updated last month
- [CVPR 2024] Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective☆20Updated last year
- [ECCV'24] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"☆57Updated 4 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆35Updated 2 months ago
- [ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection☆138Updated 7 months ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆47Updated 3 weeks ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆62Updated this week
- [ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary…☆102Updated last week
- [AAAI 2025] The official repository of our paper "GCD: Advancing Vision-Language Models for Incremental Object Detection via Global Align…☆14Updated last month
- [CVPR2025] Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection☆20Updated 9 months ago