VisionXLab / mllm-mmrotate
[IGARSS 2025] A Simple Aerial Detection Baseline of Multimodal Language Models.
☆58Updated last week
Alternatives and similar repositories for mllm-mmrotate:
Users that are interested in mllm-mmrotate are comparing it to the libraries listed below
- [CVPR 2025] Official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".☆39Updated last week
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆46Updated 2 months ago
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆26Updated 4 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆86Updated 2 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆35Updated last month
- ☆24Updated 3 months ago
- Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"☆82Updated last month
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆88Updated 2 weeks ago
- When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆19Updated last week
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆77Updated last month
- [TPAMI] Oriented object detection on STAR dataset.☆74Updated last month
- ☆33Updated 2 months ago
- ☆25Updated last year
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆78Updated last month
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆112Updated 11 months ago
- ☆39Updated 3 months ago
- ☆35Updated 8 months ago
- ☆27Updated 6 months ago
- [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"☆30Updated 5 months ago
- The Project of ECCV 2024 Oral Paper "Oriented Object Detection vis Point-Axis Representation"☆49Updated 3 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆69Updated this week
- [ECCV'24] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"☆45Updated last month
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆12Updated last week
- ☆48Updated 10 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆35Updated 2 weeks ago
- (TGRS 2024) OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images☆27Updated last week
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆35Updated last month
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆129Updated last year
- ☆107Updated 3 months ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆44Updated 2 weeks ago