KMnP / intentonomyLinks
π Intentonomy: towards Human Intent Understanding [CVPR 2021]
β37Updated 3 years ago
Alternatives and similar repositories for intentonomy
Users that are interested in intentonomy are comparing it to the libraries listed below
Sorting:
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β35Updated last year
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", httpsβ¦β36Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoningβ63Updated 2 years ago
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)β48Updated last year
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videosβ23Updated 2 years ago
- β35Updated 3 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMsβ26Updated 5 months ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)β35Updated 2 years ago
- β44Updated 4 years ago
- β29Updated last year
- Learning Representational Invariances for Data-Efficient Action Recognitionβ33Updated 3 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoningβ70Updated 2 years ago
- β19Updated last month
- PyTorch code and pretrained weights for the UNIC models.β33Updated 9 months ago
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)β51Updated 3 years ago
- [ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.β43Updated 3 years ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understandingβ¦β20Updated last year
- β29Updated 2 years ago
- Generalized Deep Metric Learning.β36Updated 3 years ago
- Market-1501 dataset with super-resolution qualityβ20Updated 3 years ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understandingβ76Updated last year
- Unifying Specialized Visual Encoders for Video Language Modelsβ21Updated last week
- β26Updated last year
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]β19Updated 2 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"β20Updated 2 years ago
- β62Updated last year
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"β60Updated 2 years ago
- Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021β32Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmarkβ54Updated 2 years ago