Lou1sM / video_annotationLinks
☆16Updated 2 years ago
Alternatives and similar repositories for video_annotation
Users that are interested in video_annotation are comparing it to the libraries listed below
Sorting:
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Updated last year
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Updated 4 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆108Updated 3 years ago
- ☆27Updated 4 years ago
- ☆16Updated 4 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆99Updated 2 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 3 years ago
- ☆40Updated 3 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆26Updated last year
- ☆30Updated 3 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Updated last year
- [SIGIR 2022] Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion☆203Updated 7 months ago
- ☆107Updated 3 years ago
- ☆21Updated 3 years ago
- This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".☆26Updated last year
- ☆22Updated 5 years ago
- Code for our paper `Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation`☆20Updated last year
- Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020☆70Updated last year
- ☆29Updated 2 years ago
- [IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition☆35Updated 5 months ago
- 东南大学多模态知识图谱-OpenRichpedia工程文件☆29Updated 4 years ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆45Updated last year
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 3 years ago
- Recent Advances in Visual Dialog☆30Updated 3 years ago
- ☆10Updated 4 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Updated 5 years ago
- ☆10Updated 6 years ago
- Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…☆19Updated 4 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆40Updated 3 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆49Updated 2 years ago