Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)
☆70Mar 19, 2021Updated 5 years ago
Alternatives and similar repositories for swig
Users that are interested in swig are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Support, annotation, evaluation, and baseline models for the imSitu dataset.☆60May 18, 2020Updated 5 years ago
- PyTorch implementation for our CVPR 2020 Paper "Attention-based Context Aware Reasoning for Situation Recognition"☆19Oct 20, 2020Updated 5 years ago
- [AAAI 2022] Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.☆13Mar 19, 2022Updated 4 years ago
- [CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for Grounded Situation Recognition"☆50Apr 9, 2023Updated 2 years ago
- [BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"☆27Mar 30, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- Utilities for the human-object interaction detection dataset HICO-DET☆63Dec 14, 2023Updated 2 years ago
- ☆107Apr 11, 2022Updated 3 years ago
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Exploration☆80Mar 5, 2023Updated 3 years ago
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020☆42Jul 14, 2020Updated 5 years ago
- A strong HOI Detection model without Frills!☆59May 12, 2019Updated 6 years ago
- A repo for processing the raw hand object detections to produce releasable pickles + library for using these☆41Oct 26, 2024Updated last year
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- Pre-trained V+L Data Preparation☆46Jun 2, 2020Updated 5 years ago
- ☆199May 10, 2023Updated 2 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- Dialog State Tracking with Deep Neural Networks☆18Apr 13, 2015Updated 10 years ago
- RareAct: A video dataset of unusual interactions☆34Aug 4, 2020Updated 5 years ago
- [ICCV'21] Official PyTorch implementation for paper "Spatially Conditioned Graphs for Detecting Human–Object Interactions"☆67May 13, 2022Updated 3 years ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆137Sep 29, 2024Updated last year
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- PIC Challenge Baseline☆18Dec 27, 2018Updated 7 years ago
- 🚴♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)☆35Jul 2, 2025Updated 8 months ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Oct 10, 2022Updated 3 years ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆60Oct 12, 2021Updated 4 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27May 26, 2023Updated 2 years ago
- CVPR2022 Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection☆24Sep 17, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Aug 6, 2020Updated 5 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆73Nov 7, 2022Updated 3 years ago
- Code for "Continual Learning of Object Instances", Implemented in PyTorch, https://arxiv.org/abs/2004.10862☆11Jun 12, 2020Updated 5 years ago
- COCO API Customized for OVIS evaluation☆17Nov 8, 2021Updated 4 years ago
- Hooks for VCOCO☆164Jun 16, 2017Updated 8 years ago
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Jan 28, 2024Updated 2 years ago
- ECCV2020: Visual Compositional Learning for Human-Object Interaction Detection☆35Apr 23, 2021Updated 4 years ago