VisionXLab / mllm-mmrotate
[IGARSS 2025] A Simple Aerial Detection Baseline of Multimodal Language Models.
☆63Updated last week
Alternatives and similar repositories for mllm-mmrotate:
Users that are interested in mllm-mmrotate are comparing it to the libraries listed below
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆26Updated 4 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆92Updated last month
- [CVPR 2025] Official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".☆49Updated last week
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆46Updated 2 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆48Updated this week
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆82Updated last month
- [TPAMI] Oriented object detection on STAR dataset.☆76Updated 2 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆38Updated 2 weeks ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆44Updated last month
- ☆26Updated 4 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆80Updated last month
- ☆40Updated 4 months ago
- (TGRS 2024) OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images☆30Updated 3 weeks ago
- ☆34Updated 3 months ago
- [CVPR'24] PointOBB: Learning Oriented Object Detection via Single Point Supervision☆67Updated 2 months ago
- ☆25Updated last year
- ☆13Updated 4 months ago
- When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆19Updated last month
- Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"☆92Updated 2 months ago
- 🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆119Updated 2 weeks ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆37Updated this week
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆72Updated 2 weeks ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆114Updated last year
- [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"☆32Updated 2 weeks ago
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆133Updated last year
- [ECCV'24] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"☆45Updated 2 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆98Updated last week
- Code and updates for the ScoreRS project.☆18Updated last month
- ☆35Updated 9 months ago
- Collection of Remote Sensing Vision-Language Models☆134Updated 11 months ago