Support, annotation, evaluation, and baseline models for the imSitu dataset.
☆60May 18, 2020Updated 5 years ago
Alternatives and similar repositories for imSitu
Users that are interested in imSitu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)☆71Mar 19, 2021Updated 5 years ago
- ☆84Apr 12, 2021Updated 5 years ago
- ☆22Dec 18, 2016Updated 9 years ago
- Visual Verb Sense Disambiguation☆13Apr 26, 2019Updated 7 years ago
- [ICCV 2019] Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations☆31Aug 6, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)☆79Oct 3, 2023Updated 2 years ago
- Predicting Hashtag from Instagram pictures using Tensorflow, TFRecords, and TF-Slim☆15Nov 7, 2016Updated 9 years ago
- [COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.☆58Sep 12, 2019Updated 6 years ago
- ☆14Dec 9, 2023Updated 2 years ago
- Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)☆545Aug 9, 2019Updated 6 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- Pytorch Implementation of Learning Similarity between Scene Graphs and Images with Transformers (GICON))☆14Nov 9, 2023Updated 2 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆24Jan 6, 2026Updated 4 months ago
- ☆69Feb 25, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- code for ACL 2023 paper 'Event Extraction as Question Generation and Answering'☆24Aug 13, 2023Updated 2 years ago
- ☆13Apr 23, 2025Updated last year
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated last year
- Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)☆22Jul 12, 2018Updated 7 years ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆21Jan 30, 2021Updated 5 years ago
- A repository for converting between CoQA, SQuAD2, and QuAC and visualizing the data.☆24Dec 11, 2018Updated 7 years ago
- Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"☆16May 6, 2020Updated 5 years ago
- Large-scale city camera video dataset☆11Jul 20, 2020Updated 5 years ago
- This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…☆12Oct 1, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Jan 17, 2022Updated 4 years ago
- ☆44Mar 8, 2021Updated 5 years ago
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 7 years ago
- Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, …☆14Mar 15, 2022Updated 4 years ago
- Code for NAACL paper☆21Aug 31, 2018Updated 7 years ago
- End-to-End, Single-Stream Temporal Action Detection in Untrimmed Videos (Official Repo for SS-TAD)☆108Oct 12, 2017Updated 8 years ago
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆22Sep 21, 2024Updated last year
- NeurIPS 2024 (spotlight): A Textbook Remedy for Domain Shifts Knowledge Priors for Medical Image Analysis☆31Oct 15, 2024Updated last year
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆31Sep 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…☆100Jul 27, 2025Updated 9 months ago
- Scene Graph Prediction with Limited Labels