Lou1sM / video_annotationLinks
☆16Updated 2 years ago
Alternatives and similar repositories for video_annotation
Users that are interested in video_annotation are comparing it to the libraries listed below
Sorting:
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆73Updated last year
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Updated 3 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆101Updated 3 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆96Updated 2 years ago
- ☆39Updated 2 years ago
- ☆30Updated 2 years ago
- ☆16Updated 4 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆26Updated last year
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 2 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 3 years ago
- [IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition☆32Updated last month
- Learning Situation Hyper-Graphs for Video Question Answering☆22Updated last year
- ☆105Updated 3 years ago
- ☆27Updated 3 years ago
- Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020☆70Updated last year
- ☆26Updated 3 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Updated 2 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆36Updated 3 years ago
- ☆10Updated 4 years ago
- ☆21Updated 3 years ago
- ☆20Updated 5 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆48Updated 2 years ago
- ☆10Updated 6 years ago
- Recent Advances in Visual Dialog☆30Updated 3 years ago
- This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".☆26Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆148Updated last year
- ☆28Updated 2 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Updated last year
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27Updated 4 years ago