KMnP / intentonomy
π Intentonomy: towards Human Intent Understanding [CVPR 2021]
β36Updated 3 years ago
Alternatives and similar repositories for intentonomy:
Users that are interested in intentonomy are comparing it to the libraries listed below
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoningβ63Updated 2 years ago
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)β46Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β35Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioningβ34Updated 2 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariancesβ¦β47Updated 3 years ago
- Learning Representational Invariances for Data-Efficient Action Recognitionβ33Updated 3 years ago
- MIST: Multiple Instance Spatial Transformerβ25Updated 3 years ago
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)β51Updated 2 years ago
- β54Updated 2 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoningβ66Updated 2 years ago
- β44Updated 3 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistantβ23Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)β35Updated 2 years ago
- Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021β32Updated 2 years ago
- β29Updated last year
- Market-1501 dataset with super-resolution qualityβ19Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)β18Updated last year
- RareAct: A video dataset of unusual interactionsβ32Updated 4 years ago
- Transformation Driven Visual Reasoning - CVPR 2021β37Updated last year
- (NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"β31Updated 3 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?β21Updated 6 months ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.β107Updated last year
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapesβ19Updated last year
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistencyβ17Updated 3 years ago
- β11Updated 3 years ago
- This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in thβ¦β63Updated 2 years ago
- β73Updated 2 years ago
- Generalized Deep Metric Learning.β35Updated 3 years ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations whβ¦β24Updated 3 years ago