Delicate2000 / MMpediaLinks
☆16Updated 2 years ago
Alternatives and similar repositories for MMpedia
Users that are interested in MMpedia are comparing it to the libraries listed below
Sorting:
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆45Updated last year
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆26Updated last year
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Updated last year
- ☆37Updated 2 years ago
- Repository for VisualSem: a high-quality knowledge graph to support research in vision and language.☆89Updated 3 years ago
- [IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition☆36Updated 5 months ago
- ☆16Updated 2 years ago
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11Updated last year
- Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".☆35Updated 10 months ago
- [KDD 2022] Multi-modal Siamese Network for Entity Alignment☆31Updated 5 months ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆99Updated 2 years ago
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆120Updated 9 months ago
- ☆19Updated 2 years ago
- [ICLR 2023] Multimodal Analogical Reasoning over Knowledge Graphs☆132Updated last year
- ☆40Updated 3 years ago
- ☆147Updated 3 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Updated 3 years ago
- Recent Advances in Visual Dialog☆30Updated 3 years ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Updated 2 years ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆53Updated last year
- ☆42Updated 2 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 3 years ago
- ☆31Updated last year
- ☆68Updated 2 years ago
- [ACL 2023] Plug-and-Play Knowledge Injection for Pre-trained Language Models☆61Updated last year
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25Updated last year
- Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…☆52Updated 2 years ago
- Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"☆27Updated 2 years ago
- [KDD 2023] Multi-Grained Multimodal Interaction Network for Entity Linking☆27Updated 2 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Updated last year