KMnP / intentonomy
π Intentonomy: towards Human Intent Understanding [CVPR 2021]
β35Updated 3 years ago
Alternatives and similar repositories for intentonomy:
Users that are interested in intentonomy are comparing it to the libraries listed below
- β34Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioningβ34Updated 2 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β35Updated last year
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]β19Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognitionβ33Updated 3 years ago
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)β46Updated last year
- β44Updated 3 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistencyβ17Updated 3 years ago
- Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021β31Updated 2 years ago
- MIST: Multiple Instance Spatial Transformerβ25Updated 3 years ago
- Generalized Deep Metric Learning.β35Updated 2 years ago
- Transformation Driven Visual Reasoning - CVPR 2021β37Updated last year
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videosβ21Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".β31Updated 2 years ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatioβ¦β27Updated last year
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"β57Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.β107Updated last year
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", httpsβ¦β36Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoningβ64Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmarkβ51Updated last year
- A visual LLM for image region description or QA.β15Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Imagesβ58Updated 3 years ago
- Market-1501 dataset with super-resolution qualityβ18Updated 2 years ago
- CCVS: Context-aware Controllable Video Synthesisβ22Updated 3 years ago
- β18Updated 10 months ago
- β24Updated last year
- Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.β50Updated 2 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)β33Updated 2 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"β20Updated last year