sunzc-sunny / refdroneLinks
RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension
☆30Updated 3 weeks ago
Alternatives and similar repositories for refdrone
Users that are interested in refdrone are comparing it to the libraries listed below
Sorting:
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆100Updated 2 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆109Updated 11 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Updated 6 months ago
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Updated 3 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Updated 7 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆59Updated 2 months ago
- [TPAMI2025&CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆188Updated last year
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆50Updated 7 months ago
- The Project of ECCV 2024 Oral Paper "Oriented Object Detection vis Point-Axis Representation"☆70Updated last year
- [ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy☆41Updated 2 months ago
- Make Large Multimodal Models excel in object detection, ICCV 2025☆61Updated 5 months ago
- [ICCV2023] CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection☆18Updated 8 months ago
- PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)☆22Updated last month
- This repository is the official implementation of our AAAI 2025 accepted paper: "PhysAug: A Physical-guided and Frequency-based Data Aug…☆19Updated 8 months ago
- ☆27Updated last month
- [TPAMI 2025] Towards Visual Grounding: A Survey☆280Updated 2 months ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆214Updated 9 months ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Updated 3 weeks ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆223Updated 3 months ago
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆169Updated last month
- [ECCV'24/IJCV'26] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"☆68Updated 2 weeks ago
- ☆31Updated last year
- This code is provided for reproducibility of results in the paper: Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve A…☆20Updated 11 months ago
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11Updated last year
- ☆26Updated last year
- A vision-language tracking paper list, articles related to visual language tracking have been documented.☆38Updated last year
- ☆12Updated 7 months ago
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆56Updated 2 months ago
- This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"☆87Updated 7 months ago
- NTIRE 2025 Challenge on 1-st Cross-Domain Few-Shot Object Detection @ CVPR 2025☆68Updated 9 months ago