VisionXLab / mllm-mmrotate
[IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.
☆68Updated 3 weeks ago
Alternatives and similar repositories for mllm-mmrotate:
Users that are interested in mllm-mmrotate are comparing it to the libraries listed below
- [CVPR 2025] Official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".☆56Updated last month
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆49Updated 3 months ago
- [TPAMI] Oriented object detection on STAR dataset.☆77Updated 3 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆56Updated last month
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆96Updated 2 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆51Updated 2 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆117Updated last year
- ☆28Updated 5 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆40Updated last month
- (TGRS 2024) OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images☆34Updated last week
- [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"☆32Updated last month
- PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection☆24Updated 2 months ago
- Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"☆105Updated 3 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆84Updated 2 months ago
- [AAAI2025] Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection☆25Updated 3 weeks ago
- When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆22Updated 2 weeks ago
- [ECCV'24] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"☆46Updated 3 months ago
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆133Updated last year
- [CVPR'24] PointOBB: Learning Oriented Object Detection via Single Point Supervision☆68Updated 3 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆81Updated 2 months ago
- ☆35Updated 10 months ago
- ☆30Updated 7 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆37Updated last month
- ☆41Updated 5 months ago
- ☆36Updated 4 months ago
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆26Updated 5 months ago
- ☆25Updated last year
- 🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆134Updated this week
- ☆58Updated last year
- ☆48Updated last year