opendatalab / LOKI
The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
☆124Updated last month
Alternatives and similar repositories for LOKI:
Users that are interested in LOKI are comparing it to the libraries listed below
- The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆42Updated last month
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆39Updated last month
- The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting☆24Updated 9 months ago
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆97Updated 7 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆48Updated 3 weeks ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆136Updated this week
- Text4Seg: Reimagining Image Segmentation as Text Generation☆40Updated 2 weeks ago
- The official implementation of "Segment Anything with Multiple Modalities".☆83Updated 4 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆67Updated last week
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆140Updated 3 months ago
- The official implementation of the paper: Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs (ICCV 2023)☆41Updated 7 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆103Updated 6 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆78Updated last week
- Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"☆219Updated 3 weeks ago
- ☆61Updated 2 months ago
- ☆36Updated 2 weeks ago
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆187Updated 6 months ago
- The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"☆13Updated 4 months ago
- Explore the Limits of Omni-modal Pretraining at Scale☆96Updated 4 months ago
- The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation". (CVPR 2024, Highligh…☆39Updated last month
- About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“. (ECCV 2024)☆59Updated this week
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆66Updated 3 months ago
- Official implement of MIA-DPO☆49Updated 2 months ago
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆19Updated 2 months ago
- ☆19Updated last month
- ☆34Updated last month
- Awesome lists about framework figures in papers☆70Updated last month
- ☆76Updated 8 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆123Updated last month
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆147Updated this week