Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 2023)
☆18Apr 23, 2024Updated last year
Alternatives and similar repositories for OVQA
Users that are interested in OVQA are comparing it to the libraries listed below
Sorting:
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆35Apr 23, 2024Updated last year
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆24Jan 26, 2025Updated last year
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆12Mar 10, 2025Updated last year
- ☆10Apr 19, 2024Updated last year
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆78Mar 26, 2025Updated 11 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last month
- ☆16Jun 5, 2023Updated 2 years ago
- ☆16Jun 5, 2023Updated 2 years ago
- ☆13Mar 28, 2025Updated 11 months ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆15Jan 15, 2025Updated last year
- Official Implementation (Pytorch) of "DAVI: Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems", ECCV 2024 …☆74Aug 16, 2024Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Mar 10, 2025Updated last year
- Official Implementation (Pytorch) of the "LLaMo: Large Language Model-based Molecular Graph Assistant", NeurIPS 2024☆36Feb 12, 2025Updated last year
- Official PyTorch implementation of "Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis" (ICML 2024).☆20Nov 20, 2024Updated last year
- Basic Artificial Intelligence Theory☆10Mar 11, 2025Updated last year
- Official pytorch implementation of 'Relation-aware Language-Graph Transformer for Question Answering' (AAAI 2023)☆18Apr 25, 2023Updated 2 years ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆45Dec 11, 2023Updated 2 years ago
- [NeurIPS 24] Official Implementation (Pytorch) of "Inversion-based Latent Bayesian Optimization"☆10Nov 15, 2024Updated last year
- ☆26Feb 3, 2023Updated 3 years ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆55Oct 21, 2025Updated 5 months ago
- [CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"☆118Apr 22, 2025Updated 11 months ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆45Sep 12, 2024Updated last year
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆55Aug 19, 2023Updated 2 years ago
- This repo contains code for Invariant Grounding for Video Question Answering☆27Mar 2, 2023Updated 3 years ago
- ☆80Nov 24, 2024Updated last year
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- KV cache compression via sparse coding☆17Oct 26, 2025Updated 4 months ago
- ☆13May 15, 2025Updated 10 months ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆14Jul 26, 2023Updated 2 years ago
- ☆19Jun 10, 2025Updated 9 months ago
- ☆10Jun 28, 2023Updated 2 years ago
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆14Nov 28, 2022Updated 3 years ago
- ☆12Dec 15, 2023Updated 2 years ago
- ☆10Jan 8, 2020Updated 6 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆33Jul 12, 2023Updated 2 years ago