StanLei52 / TQVSRLinks

[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant

☆23

Alternatives and similar repositories for TQVSR

Users that are interested in TQVSR are comparing it to the libraries listed below

Sorting:

TheShadow29 / VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
☆61Updated 3 years ago
LisaAnne / TemporalLanguageRelease
☆43Updated 4 years ago
salesforce / paprika
Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"
☆50Updated 6 months ago
antoine77340 / RareAct
RareAct: A video dataset of unusual interactions
☆32Updated 5 years ago
airsplay / vimpac
☆73Updated 3 years ago
zinengtang / DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Updated 2 years ago
facebookresearch / video-distant-supervision
This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…
☆43Updated 2 years ago
MikeWangWZHL / VidIL
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
☆115Updated 2 years ago
microsoft / LAVENDER
A Unified Framework for Video-Language Understanding
☆57Updated 2 years ago
antoyang / just-ask
[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
☆123Updated last year
showlab / Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆42Updated 3 years ago
leonnnop / VAR
[CVPR 2022] Visual Abductive Reasoning
☆122Updated 9 months ago
guilk / VLC
Research code for "Training Vision-Language Transformers from Captions Alone"
☆34Updated 3 years ago
NVlabs / Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
☆71Updated 2 years ago
tsujuifu / pytorch_empirical-mvm
A PyTorch implementation of EmpiricalMVM
☆41Updated last year
IIGROUP / PUM
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
☆19Updated 4 years ago
facebookresearch / ProcedureVRL
[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
☆54Updated 2 years ago
microsoft / UniTAB
UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)
☆87Updated 2 years ago
StanLei52 / GEBD
[ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation
☆69Updated 3 years ago
alirezazareian / vspnet
Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing
☆35Updated 2 years ago
jayleicn / singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
☆135Updated 2 years ago
soCzech / LookForTheChange
Code for Look for the Change paper published at CVPR 2022
☆36Updated 2 years ago
Vision-CAIR / LTVRR
☆35Updated last year
cambridgeltl / visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
☆128Updated 2 years ago
NNNNAI / Ego4d_NLQ_2022_1st_Place_Solution
The 1st place solution of 2022 Ego4d Natural Language Queries.
☆32Updated 2 years ago
chaoyuaw / lvu
☆85Updated last year
Chuhanxx / Temporal_Query_Networks
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆62Updated 3 years ago
klauscc / VindLU
☆108Updated 2 years ago
medhini / clip_it
CLIP-It! Language-Guided Video Summarization
☆75Updated 4 years ago
ShiYaya / emscore
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Updated 2 years ago