Yutong-Zhou-cv / AgriBench
[ECCV 2024 Workshopπ] The first agriculture benchmark to evaluate MM-LLMs.
β15Updated 3 months ago
Alternatives and similar repositories for AgriBench:
Users that are interested in AgriBench are comparing it to the libraries listed below
- GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasksβ36Updated last week
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Groundingβ46Updated 2 months ago
- β21Updated 8 months ago
- β10Updated 4 months ago
- A Large Multimodal Model for Remote Sensing Change Descriptionβ18Updated 5 months ago
- Accompanying repo for 'Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs' projectβ26Updated 7 months ago
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?β26Updated 4 months ago
- When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruningβ19Updated last month
- β26Updated 4 months ago
- β40Updated 4 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image β¦β72Updated 2 weeks ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysisβ82Updated last month
- Detectron2 Toolbox and Benchmark for V3Detβ16Updated 10 months ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"β31Updated 6 months ago
- Foundation models & LLMsβ43Updated 2 weeks ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Desβ¦β55Updated 9 months ago
- β48Updated 11 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Groundingβ56Updated 5 months ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentationβ49Updated last month
- Code and updates for the ScoreRS project.β18Updated last month
- Paper list for LLM/MLLM-based image segmentationβ14Updated this week
- β10Updated last month
- β12Updated 4 months ago
- This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.β49Updated last year
- The official implementation of "Segment Anything with Multiple Modalities".β92Updated 7 months ago
- β13Updated 4 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentationβ83Updated 3 weeks ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understandingβ92Updated last month
- β19Updated 3 weeks ago
- Segment Anything with Deictic Promptingβ25Updated 5 months ago