EGO4D / social-interactionsView external linksLinks
☆56Aug 7, 2022Updated 3 years ago
Alternatives and similar repositories for social-interactions
Users that are interested in social-interactions are comparing it to the libraries listed below
Sorting:
- ☆78Jan 5, 2024Updated 2 years ago
- ☆80Sep 4, 2022Updated 3 years ago
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Oct 28, 2025Updated 3 months ago
- Pytorch implementation of Yolo V3☆11Aug 30, 2018Updated 7 years ago
- ☆13Apr 23, 2025Updated 9 months ago
- ☆132May 30, 2024Updated last year
- [WACV 2026] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation☆20Feb 4, 2026Updated last week
- Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset☆533Feb 4, 2026Updated last week
- ☆17Jul 25, 2023Updated 2 years ago
- Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3☆18Jan 14, 2023Updated 3 years ago
- ☆23Jun 12, 2023Updated 2 years ago
- We create D3D-HOI a dataset of monocular videos with ground truth annotations of 3D object pose and part motion during human-object inter…☆95Sep 19, 2021Updated 4 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- Audio-conditioned video texture generation☆24Sep 16, 2022Updated 3 years ago
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆26Nov 30, 2023Updated 2 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆28Sep 23, 2024Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Sep 10, 2022Updated 3 years ago
- Python scripts to download Assembly101 from Google Drive☆63Oct 10, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆254May 9, 2024Updated last year
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- the official code of VoteHMR☆32Jun 18, 2022Updated 3 years ago
- Source code for "Rethinking training of 3D GANs"☆31May 26, 2022Updated 3 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- The MECCANO Dataset: official repository in which we provide code and models.☆32Jul 31, 2023Updated 2 years ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆37Feb 20, 2023Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago
- Motion Question Answering via Modular Motion Programs☆38May 24, 2023Updated 2 years ago
- Code for Look for the Change paper published at CVPR 2022☆36Oct 26, 2022Updated 3 years ago
- VisualEchoes Dataset (ECCV 2020)☆35Aug 31, 2021Updated 4 years ago
- ☆26Sep 18, 2020Updated 5 years ago
- Hand detection models trained on 100DOH (100 Days of Hands) dataset.☆82Apr 15, 2021Updated 4 years ago
- [CVPR 2022] Egocentric Action Target Prediction in 3D☆32Dec 2, 2025Updated 2 months ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- ☆10Jun 26, 2024Updated last year
- ☆17Sep 10, 2025Updated 5 months ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆87Mar 7, 2023Updated 2 years ago