zcablii / ViTPLinks
Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"
☆72Updated this week
Alternatives and similar repositories for ViTP
Users that are interested in ViTP are comparing it to the libraries listed below
Sorting:
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆137Updated last year
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆35Updated 2 months ago
- [TPAMI] Oriented object detection on STAR dataset.☆82Updated 8 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆76Updated 4 months ago
- ☆19Updated last year
- This is the pytorch implement of the paper "RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models"☆63Updated 2 months ago
- Code and updates for the ScoreRS project.☆30Updated last month
- ☆37Updated 4 months ago
- Paper list for LLM/MLLM-based image segmentation☆35Updated this week
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆65Updated 9 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆42Updated 4 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆60Updated 8 months ago
- This is a official code repository of ROS-SAM☆56Updated 6 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆54Updated 4 months ago
- ☆11Updated last year
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆48Updated 2 months ago
- ☆86Updated 8 months ago
- AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation☆114Updated 4 months ago
- [AAAI2025] Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection☆29Updated 3 months ago
- The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretatio…☆53Updated 11 months ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆50Updated 5 months ago
- [CVPR 2025] Official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".☆118Updated last month
- [IEEE TGRS 2025] Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation☆26Updated last month
- [TGRS 2024] Co-training Transformer for Remote Sensing Image Classification, Segmentation and Detection.☆44Updated 6 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆68Updated 5 months ago
- [CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.☆51Updated 2 months ago
- ☆36Updated last year
- [IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection☆35Updated 3 weeks ago
- This is the implement of the paper "RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models"☆16Updated 2 months ago
- ☆60Updated 5 months ago