opendatalab / LOKI
[ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
☆135Updated last week
Alternatives and similar repositories for LOKI:
Users that are interested in LOKI are comparing it to the libraries listed below
- The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆20Updated this week
- FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆25Updated this week
- The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆46Updated last week
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆28Updated this week
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆39Updated 3 months ago
- The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [Arxiv]☆23Updated last year
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆70Updated last week
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆153Updated 2 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆50Updated 3 months ago
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆100Updated 9 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆25Updated 5 months ago
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆81Updated 3 weeks ago
- Official repository for VisionZip (CVPR 2025)☆259Updated last month
- The official implementation of "Segment Anything with Multiple Modalities".☆89Updated 6 months ago
- [ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.☆69Updated 2 weeks ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆31Updated this week
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆210Updated 8 months ago
- Official implement of MIA-DPO☆54Updated 2 months ago
- ☆56Updated last week
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆140Updated 3 weeks ago
- Official implementation of Unified Reward Model for Multimodal Understanding and Generation.☆225Updated this week
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode☆47Updated 3 months ago
- ☆107Updated last month
- ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?☆99Updated 3 weeks ago
- The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate".☆96Updated 4 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆152Updated 5 months ago
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆169Updated 6 months ago
- The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"☆14Updated 6 months ago
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆38Updated 3 months ago
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆53Updated 11 months ago