JinYuanLi0012 / RiVEGView external linksLinks
[IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
☆37Jul 19, 2025Updated 6 months ago
Alternatives and similar repositories for RiVEG
Users that are interested in RiVEG are comparing it to the libraries listed below
Sorting:
- [AAAI 2025] Code for the paper: "Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition"☆37Apr 15, 2025Updated 10 months ago
- ☆39Nov 28, 2023Updated 2 years ago
- This is code for Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Taggin…☆65Mar 16, 2024Updated last year
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆46Jun 5, 2024Updated last year
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- Source code of paper:"Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction".☆20Nov 3, 2023Updated 2 years ago
- The source of MNER-MI.☆19Dec 17, 2024Updated last year
- ☆23Apr 1, 2024Updated last year
- awesome-multimodal-named-entity-recognition☆61Oct 30, 2023Updated 2 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- Teeth Mold Point Cloud Completion Via Data Augmentation and Hybrid RL-GAN (Paper Code)☆13May 23, 2023Updated 2 years ago
- ☆18Feb 17, 2023Updated 3 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆16Mar 31, 2025Updated 10 months ago
- Preprocessed Datasets for our Multimodal NER paper☆123Dec 17, 2022Updated 3 years ago
- [CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"☆19Oct 10, 2023Updated 2 years ago
- [ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"☆40Dec 22, 2023Updated 2 years ago
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆121Mar 13, 2025Updated 11 months ago
- Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…☆20Mar 3, 2021Updated 4 years ago
- ☆20Jul 28, 2025Updated 6 months ago
- Code for ACL 2023 paper "Rethinking Multimodal Entity and Relation Extraction from a Translation Point of View"☆24Jan 18, 2026Updated 3 weeks ago
- [ACM MM 2021] A causal perspective for compositional action recognition, providing a counterfactual debiasing inference implementation to…☆20May 5, 2022Updated 3 years ago
- ☆22Apr 12, 2022Updated 3 years ago
- Recent Advances in Visual Dialog☆30Aug 19, 2022Updated 3 years ago
- Code and data for ACM MM '23 paper “MORE: A Multimodal Object-Entity Relation Extraction Dataset with a Benchmark Evaluation”☆27Aug 20, 2024Updated last year
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆34Aug 5, 2025Updated 6 months ago
- Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".☆36Feb 27, 2025Updated 11 months ago
- Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance☆68Oct 16, 2024Updated last year
- A reading list of papers about Visual Grounding.☆32Aug 24, 2022Updated 3 years ago
- ☆39Feb 9, 2026Updated last week
- ☆13Feb 17, 2025Updated last year
- DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction☆41Apr 19, 2023Updated 2 years ago
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 2 months ago
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- Pytorch implementation for the paper: Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognitio…☆15Sep 19, 2024Updated last year
- A python library / model for creating co-references between AMR graph nodes.☆11Dec 11, 2022Updated 3 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- ☆14May 1, 2023Updated 2 years ago