[IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
☆39Jul 19, 2025Updated 7 months ago
Alternatives and similar repositories for RiVEG
Users that are interested in RiVEG are comparing it to the libraries listed below
Sorting:
- [EMNLP 2023 Findings] Prompting Chatgpt in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge☆33Nov 10, 2024Updated last year
- [AAAI 2025] Code for the paper: "Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition"☆37Apr 15, 2025Updated 10 months ago
- ☆39Nov 28, 2023Updated 2 years ago
- This is code for Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Taggin…☆65Mar 16, 2024Updated last year
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆46Jun 5, 2024Updated last year
- Source code of paper:"Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction".☆20Nov 3, 2023Updated 2 years ago
- ☆23Apr 1, 2024Updated last year
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- ☆18Feb 17, 2023Updated 3 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆16Mar 31, 2025Updated 11 months ago
- [CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"☆19Oct 10, 2023Updated 2 years ago
- [ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"☆41Dec 22, 2023Updated 2 years ago
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆121Mar 13, 2025Updated 11 months ago
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆20Dec 5, 2024Updated last year
- Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…☆20Mar 3, 2021Updated 5 years ago
- ☆20Jul 28, 2025Updated 7 months ago
- Code for ACL 2023 paper "Rethinking Multimodal Entity and Relation Extraction from a Translation Point of View"☆24Jan 18, 2026Updated last month
- [ACM MM 2021] A causal perspective for compositional action recognition, providing a counterfactual debiasing inference implementation to…☆20May 5, 2022Updated 3 years ago
- ☆22Apr 12, 2022Updated 3 years ago
- CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model☆25Apr 27, 2024Updated last year
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…☆28Feb 23, 2025Updated last year
- 📝 Source code for "ECNU-SenseMaker at SemEval-2020 Task 4: Leveraging Heterogeneous Knowledge Resources for Commonsense Validation and E…☆23Jun 17, 2023Updated 2 years ago
- Recent Advances in Visual Dialog☆30Aug 19, 2022Updated 3 years ago
- [CCKS2022 ] Multimodal Event Detection and Argument Extraction.☆31Dec 4, 2022Updated 3 years ago
- Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…☆70Nov 23, 2021Updated 4 years ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆34Aug 5, 2025Updated 7 months ago
- Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".☆36Feb 27, 2025Updated last year
- Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance☆68Oct 16, 2024Updated last year
- ☆13Feb 17, 2025Updated last year
- DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction☆41Apr 19, 2023Updated 2 years ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 9 months ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 2 years ago
- A python library / model for creating co-references between AMR graph nodes.☆11Dec 11, 2022Updated 3 years ago
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- ☆44Feb 9, 2026Updated last month
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago