Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)
☆71Mar 19, 2021Updated 5 years ago
Alternatives and similar repositories for swig
Users that are interested in swig are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Support, annotation, evaluation, and baseline models for the imSitu dataset.☆60May 18, 2020Updated 5 years ago
- [CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for Grounded Situation Recognition"☆51Apr 9, 2023Updated 3 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Exploration☆80Mar 5, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A strong HOI Detection model without Frills!☆59May 12, 2019Updated 6 years ago
- GTNet:Guided Transformer Network for Detecting Human-Object Interactions☆15Apr 4, 2023Updated 3 years ago
- A repo for processing the raw hand object detections to produce releasable pickles + library for using these☆41Oct 26, 2024Updated last year
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Tools for the Parse-27k Dataset - evaluation routines and some simple scripts to get started...☆11Jul 16, 2016Updated 9 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- Pre-trained V+L Data Preparation☆46Jun 2, 2020Updated 5 years ago
- ☆201May 10, 2023Updated 2 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Dialog State Tracking with Deep Neural Networks☆18Apr 13, 2015Updated 11 years ago
- ☆10Jul 5, 2024Updated last year
- [ICCV'21] Official PyTorch implementation for paper "Spatially Conditioned Graphs for Detecting Human–Object Interactions"☆67May 13, 2022Updated 3 years ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆139Updated this week
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- Code for paper "Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification"☆16Jul 4, 2023Updated 2 years ago
- PIC Challenge Baseline☆18Dec 27, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🚴♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)☆35Jul 2, 2025Updated 9 months ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Oct 10, 2022Updated 3 years ago
- ☆12Jul 20, 2024Updated last year
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆60Oct 12, 2021Updated 4 years ago
- CVPR2022 Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection☆24Sep 17, 2022Updated 3 years ago
- SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition, AAAI2020.☆14Dec 15, 2020Updated 5 years ago
- My personal CV☆12Aug 12, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Aug 6, 2020Updated 5 years ago
- Code for "Continual Learning of Object Instances", Implemented in PyTorch, https://arxiv.org/abs/2004.10862☆11Jun 12, 2020Updated 5 years ago
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Jan 28, 2024Updated 2 years ago
- ☆18Apr 27, 2019Updated 6 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆55Jun 25, 2025Updated 9 months ago
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆106Jul 18, 2024Updated last year