zcablii / ViTPLinks
Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"
☆81Updated 2 weeks ago
Alternatives and similar repositories for ViTP
Users that are interested in ViTP are comparing it to the libraries listed below
Sorting:
- This is the implement of the paper "RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models"☆19Updated 3 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆140Updated last year
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆77Updated 5 months ago
- This is the pytorch implement of the paper "RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models"☆65Updated 3 months ago
- ☆87Updated 9 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆38Updated 3 months ago
- [TPAMI] Oriented object detection on STAR dataset.☆83Updated 9 months ago
- Paper list for LLM/MLLM-based image segmentation☆36Updated last week
- This is a official code repository of ROS-SAM☆57Updated 6 months ago
- ☆37Updated 5 months ago
- AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation☆119Updated 4 months ago
- ☆19Updated last year
- Code and updates for the ScoreRS project.☆32Updated last month
- ☆11Updated last year
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆67Updated 10 months ago
- [AAAI2025] Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection☆29Updated 4 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆200Updated 4 months ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆210Updated 2 weeks ago
- SARLANG-1M is a large-scale benchmark tailored for multimodal SAR image understanding, with a primary focus on integrating SAR with textu…☆34Updated 4 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆55Updated this week
- [CVPR 2025] Official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".☆127Updated 2 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆63Updated 8 months ago
- Implementation of paper "CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis"☆23Updated 2 months ago
- ☆33Updated 10 months ago
- The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretatio…☆53Updated 11 months ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆146Updated last month
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆55Updated 5 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆48Updated last week
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆43Updated 4 months ago
- [CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.☆57Updated 3 months ago