Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for SHG-VQA
Users that are interested in SHG-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- ☆12Dec 15, 2023Updated 2 years ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆18Oct 31, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 4 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆50Apr 27, 2025Updated last year
- A video database bridging human actions and human-object relationships☆163Jun 30, 2020Updated 5 years ago
- ☆30Dec 16, 2022Updated 3 years ago
- Exploring Large Language Models for Trajectory Prediction: A Technical Perspective☆29Jun 12, 2024Updated last year
- Video Graph Transformer for Video Question Answering (ECCV'22)☆49Jun 8, 2023Updated 2 years ago
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆53Feb 13, 2025Updated last year
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Oct 20, 2022Updated 3 years ago
- ☆37Dec 20, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆77Mar 26, 2025Updated last year
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆20Mar 9, 2024Updated 2 years ago
- Chain-of-Frames [CVPR 2026]☆40Jul 2, 2025Updated 10 months ago
- ☆36Apr 18, 2024Updated 2 years ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated 2 years ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆45Mar 28, 2024Updated 2 years ago
- Code for CVPR'21 paper "Weakly Supervised Action Selection Learning in Video"☆24Apr 1, 2021Updated 5 years ago
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆25Sep 9, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆18Apr 30, 2025Updated last year
- ☆40Nov 29, 2022Updated 3 years ago
- This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.☆28Mar 19, 2025Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 6 months ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆43Sep 23, 2021Updated 4 years ago
- DASFAA 2025: Diffusion-based Hierarchical Negative Sampling for Multimodal Knowledge Graph Completion☆18Feb 17, 2025Updated last year
- ☆24Oct 8, 2023Updated 2 years ago
- a robust metric (robust fidelity) for XGNN (ICLR24)☆12Jun 3, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆162May 28, 2024Updated 2 years ago
- ☆81Nov 24, 2024Updated last year
- ☆22Mar 7, 2025Updated last year
- Repository for the implementation of our work on hypergraph generation as part of the ANR project "SODA".☆14Oct 27, 2025Updated 7 months ago
- PyTorch Implementation for "Eliciting Structural and Semantic Global Knowledge in Unsupervised Graph Contrastive Learning" (AAAI2023)☆25Feb 12, 2025Updated last year
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆68Oct 11, 2021Updated 4 years ago