Tushar-N / attributes-as-operators
Attribute-Object Visual Composition using Attributes as Operators
☆64Updated last year
Related projects ⓘ
Alternatives and complementary repositories for attributes-as-operators
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- ☆86Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆58Updated 3 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆69Updated 4 years ago
- ☆11Updated 7 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Updated 4 years ago
- Multi-sense word embeddings from visual co-occurrences☆25Updated 5 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆44Updated 4 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 4 months ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆35Updated last year
- Scene Graph Prediction with Limited Labels☆54Updated last year
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Code for the Globetrotter project☆23Updated 2 years ago
- NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering☆59Updated 3 years ago
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆103Updated 3 years ago
- Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"☆26Updated last year
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Updated 5 years ago
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆44Updated 4 years ago
- ☆74Updated 2 years ago
- A paper list of visual semantic embeddings and text-image retrieval.☆41Updated 3 years ago
- Creativity Inspired Zero-Shot Learning☆31Updated 3 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 4 years ago
- ☆54Updated 4 years ago
- Video Noise Contrastive Estimation☆65Updated last year
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33Updated 2 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 4 months ago