TheShadow29 / VidSituView external linksLinks
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
☆61Aug 17, 2021Updated 4 years ago
Alternatives and similar repositories for VidSitu
Users that are interested in VidSitu are comparing it to the libraries listed below
Sorting:
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆194Sep 21, 2022Updated 3 years ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 9 months ago
- ☆14Dec 9, 2023Updated 2 years ago
- A video database bridging human actions and human-object relationships☆155Jun 30, 2020Updated 5 years ago
- Codes for ECCV paper: "Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation"☆16Jul 20, 2020Updated 5 years ago
- SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition, AAAI2020.☆14Dec 15, 2020Updated 5 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆88Jun 12, 2023Updated 2 years ago
- ☆87Mar 4, 2024Updated last year
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆723Aug 8, 2023Updated 2 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆225Mar 15, 2022Updated 3 years ago
- Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)☆69Mar 19, 2021Updated 4 years ago
- CVPR 2021 | Code to reproduce the results of the paper: A Khakzar, S Baselizadeh, S Khanduja, C Rupprecht, ST Kim, N Navab, Neural Respon…☆12Jun 23, 2021Updated 4 years ago
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆51Aug 20, 2022Updated 3 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆161Apr 29, 2020Updated 5 years ago
- ☆95Feb 14, 2022Updated 3 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Generation"☆201Apr 2, 2020Updated 5 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆59Aug 27, 2022Updated 3 years ago
- Code for the HowTo100M paper☆291Mar 10, 2020Updated 5 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors☆16Jun 7, 2023Updated 2 years ago
- Tools for movie and video research☆302Jun 20, 2022Updated 3 years ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆254May 9, 2024Updated last year
- ☆34Jun 2, 2023Updated 2 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Oct 21, 2021Updated 4 years ago
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated last year
- Official Github repo of the VIST Challenge NAACL 2018☆17Aug 3, 2018Updated 7 years ago
- ☆13Feb 14, 2022Updated 3 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆565Aug 22, 2025Updated 5 months ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- awesome grounding: A curated list of research papers in visual grounding☆1,125Sep 21, 2025Updated 4 months ago
- ☆22Feb 25, 2021Updated 4 years ago
- ☆17Nov 14, 2022Updated 3 years ago
- ☆40Nov 23, 2022Updated 3 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆232Jun 13, 2022Updated 3 years ago