Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)
☆70Mar 19, 2021Updated 4 years ago
Alternatives and similar repositories for swig
Users that are interested in swig are comparing it to the libraries listed below
Sorting:
- Support, annotation, evaluation, and baseline models for the imSitu dataset.☆60May 18, 2020Updated 5 years ago
- PyTorch implementation for our CVPR 2020 Paper "Attention-based Context Aware Reasoning for Situation Recognition"☆20Oct 20, 2020Updated 5 years ago
- [AAAI 2022] Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.☆13Mar 19, 2022Updated 3 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- ☆10Jul 5, 2024Updated last year
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020☆42Jul 14, 2020Updated 5 years ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆137Sep 29, 2024Updated last year
- ☆107Apr 11, 2022Updated 3 years ago
- ☆13Apr 23, 2025Updated 10 months ago
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Jan 28, 2024Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Exploration☆80Mar 5, 2023Updated 3 years ago
- A repo for processing the raw hand object detections to produce releasable pickles + library for using these☆39Oct 26, 2024Updated last year
- COCO API Customized for OVIS evaluation☆16Nov 8, 2021Updated 4 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆33Aug 4, 2020Updated 5 years ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 10 months ago
- Code for paper "Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification"☆16Jul 4, 2023Updated 2 years ago
- Java/python library and validator for the AIDA Interchange Format (AIF). Originally based on isi-vista/gaia-interchange.☆21Jun 14, 2023Updated 2 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Jan 15, 2021Updated 5 years ago
- Dialog State Tracking Challenge viewer and tracker☆15Nov 19, 2016Updated 9 years ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆40Feb 27, 2026Updated last week
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Aug 4, 2025Updated 7 months ago
- [ICML 2025] Official implementation of Spherical Diffusion Policy: A SE(3) Equivariant Visuomotor Policy with Spherical Fourier Represent…☆39Jul 8, 2025Updated 7 months ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Aug 6, 2020Updated 5 years ago
- A strong HOI Detection model without Frills!☆59May 12, 2019Updated 6 years ago
- A paper list that includes world models or generative video models for embodied agents.☆26Jan 17, 2025Updated last year
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆18Jan 2, 2021Updated 5 years ago
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Oct 10, 2022Updated 3 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆50Dec 18, 2023Updated 2 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- ☆23May 22, 2024Updated last year
- ☆96Feb 14, 2022Updated 4 years ago
- Code for the model "Heterogeneous Graph Learning for Visual Commonsense Reasoning (NeurlPS 2019)"☆47Jul 27, 2020Updated 5 years ago
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- PIC Challenge Baseline☆18Dec 27, 2018Updated 7 years ago