opendatalab / LOKI
[ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
☆130Updated last week
Alternatives and similar repositories for LOKI:
Users that are interested in LOKI are comparing it to the libraries listed below
- The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆42Updated 2 months ago
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆39Updated 2 months ago
- The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [Arxiv]☆23Updated 10 months ago
- [ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.☆62Updated last week
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆97Updated 8 months ago
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆39Updated 2 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆143Updated last month
- The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"☆14Updated 5 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆22Updated 4 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆53Updated 3 weeks ago
- The official implementation of the paper: Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs (ICCV 2023)☆45Updated 8 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆72Updated 3 weeks ago
- The official implementation of "Segment Anything with Multiple Modalities".☆84Updated 5 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆47Updated last month
- [ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models☆84Updated 6 months ago
- ☆65Updated 3 months ago
- The source code for "UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All"☆39Updated 10 months ago
- The models, datasets(satellite&street view) and correlative config files of OmniCity-v1.0 project.☆27Updated last year
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆74Updated this week
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆152Updated this week
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode☆39Updated 2 months ago
- Official implementation and datasets of AddressCLIP☆48Updated 7 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆106Updated 8 months ago
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆197Updated 7 months ago
- ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?☆92Updated last week
- Precision Search through Multi-Style Inputs☆63Updated 6 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆82Updated last month
- Official implementation of the Law of Vision Representation in MLLMs☆149Updated 3 months ago