KMnP / intentonomy
π Intentonomy: towards Human Intent Understanding [CVPR 2021]
β33Updated 3 years ago
Related projects β
Alternatives and complementary repositories for intentonomy
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioningβ33Updated 2 years ago
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)β52Updated 2 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β33Updated last year
- Market-1501 dataset with super-resolution qualityβ18Updated 2 years ago
- Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)β22Updated 3 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".β32Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognitionβ32Updated 3 years ago
- β34Updated 2 years ago
- β41Updated 2 years ago
- β44Updated 3 years ago
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"β56Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.β21Updated 3 years ago
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)β45Updated last year
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatioβ¦β23Updated last year
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistencyβ17Updated 2 years ago
- Transformation Driven Visual Reasoning - CVPR 2021β34Updated last year
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.β17Updated 4 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoningβ64Updated 2 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)β33Updated 3 years ago
- Generalized Deep Metric Learning.β35Updated 2 years ago
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)β144Updated 2 years ago
- β35Updated 10 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasksβ59Updated last month
- Codes for our CVPR 2021 paper "Deep Compositional Metric Learning"β19Updated 3 years ago
- [NeurIPS 2021] Official Matlab implementation of LOD: Large-Scale Unsupervised Object Discovery.β20Updated 2 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)β32Updated last year
- β26Updated 3 years ago
- β42Updated last year
- MIST: Multiple Instance Spatial Transformerβ25Updated 3 years ago