[ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts
☆14Jan 13, 2025Updated last year
Alternatives and similar repositories for vitis
Users that are interested in vitis are comparing it to the libraries listed below
Sorting:
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆19Aug 5, 2025Updated 6 months ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago
- ☆14Apr 1, 2023Updated 2 years ago
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆13Jan 10, 2024Updated 2 years ago
- ☆12Mar 30, 2023Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- ☆18Feb 20, 2025Updated last year
- TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition☆17Mar 26, 2024Updated last year
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Nov 5, 2023Updated 2 years ago
- [ICCV'23] PAINet: Parallel Attention Interaction Network for Few-shot Skeleton-based Action Recognition☆11Oct 14, 2023Updated 2 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- ☆15Aug 4, 2025Updated 6 months ago
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- Repo of NeurIPS23☆18Oct 25, 2023Updated 2 years ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- ☆17Jun 15, 2022Updated 3 years ago
- Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…☆18Jun 22, 2021Updated 4 years ago
- [NeurIPS 2023] Meta-Adapter☆48Nov 21, 2023Updated 2 years ago
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Dec 9, 2024Updated last year
- ☆21May 11, 2025Updated 9 months ago
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 7 months ago
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆30Apr 4, 2022Updated 3 years ago
- ☆33Mar 7, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆77Mar 7, 2024Updated last year
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆27Jan 3, 2023Updated 3 years ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Nov 2, 2023Updated 2 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆83Jul 1, 2024Updated last year
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- Pytorch implementation for the paper: Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognitio…☆15Sep 19, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- ☆16Oct 9, 2024Updated last year
- CVPR 2024 Official Repository☆12Mar 27, 2024Updated last year
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 4 months ago