Lou1sM / video_annotation
☆17Updated last year
Alternatives and similar repositories for video_annotation:
Users that are interested in video_annotation are comparing it to the libraries listed below
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆71Updated last year
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Updated 3 years ago
- ☆38Updated 2 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆26Updated last year
- Learning Situation Hyper-Graphs for Video Question Answering☆20Updated last year
- ☆28Updated 2 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆96Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆94Updated 2 years ago
- ☆20Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 2 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 3 years ago
- ☆10Updated 5 years ago
- Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)☆72Updated last year
- ☆102Updated 3 years ago
- 东南大学多模态知识图 谱-OpenRichpedia工程文件☆29Updated 3 years ago
- Repository for VisualSem: a high-quality knowledge graph to support research in vision and language.☆88Updated 2 years ago
- ☆16Updated 3 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆27Updated 2 years ago
- [SIGIR 2022] Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion☆183Updated last week
- ☆12Updated last year
- Video Graph Transformer for Video Question Answering (ECCV'22)☆47Updated last year
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27Updated 3 years ago
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆110Updated last month
- A Few-Shot Learning based Approach to Multimodal Social Relation Extraction☆13Updated 2 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Updated 4 years ago
- Code of the paper Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion (ACM MM22))☆19Updated 11 months ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆25Updated 2 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Updated 2 years ago
- Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…☆55Updated 3 years ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆34Updated 10 months ago