Lou1sM / video_annotationLinks
☆16Updated 2 years ago
Alternatives and similar repositories for video_annotation
Users that are interested in video_annotation are comparing it to the libraries listed below
Sorting:
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Updated last year
- ☆40Updated 2 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Updated 4 years ago
- ☆16Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 3 years ago
- ☆106Updated 3 years ago
- ☆30Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆98Updated 2 years ago
- ☆26Updated 4 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆102Updated 3 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆26Updated last year
- ☆22Updated 5 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Updated last year
- ☆21Updated 3 years ago
- [IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition☆35Updated 3 months ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆48Updated 2 years ago
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆19Updated 4 years ago
- This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".☆26Updated last year
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Updated 3 years ago
- Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"☆43Updated 3 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020☆70Updated last year
- ☆25Updated 3 years ago
- Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…☆18Updated 4 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 3 years ago
- 东南大学多模态知识图谱-OpenRichpedia工程文件☆29Updated 4 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Updated 5 years ago
- Recent Advances in Visual Dialog☆30Updated 3 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆37Updated 3 years ago
- ☆10Updated 4 years ago