HimangiM / RepLAI
Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTorch. (NeurIPS 2022)
☆11Updated last year
Related projects: ⓘ
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆13Updated 5 months ago
- Download scripts for EPIC-KITCHENS☆121Updated last month
- ☆67Updated 8 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆27Updated last week
- ☆19Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆28Updated last year
- ☆27Updated 5 months ago
- SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)☆16Updated last year
- CVPR2022☆20Updated 2 years ago
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆24Updated 7 months ago
- ☆23Updated 3 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆12Updated last year
- ☆13Updated last month
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) f…☆22Updated last year
- Code for Look for the Change paper published at CVPR 2022☆35Updated last year
- ☆45Updated 2 years ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆35Updated 5 months ago
- Code release for "Training a Large Video Model on a Single Machine in a Day"☆107Updated last month
- Official codebase for EmbCLIP☆111Updated last year
- Video narrator written in Python/GTK using vlc-lib☆23Updated 2 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆17Updated 5 months ago
- A repo for processing the raw hand object detections to produce releasable pickles + library for using these☆33Updated 2 years ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆85Updated 2 months ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆17Updated 2 years ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆38Updated last month
- ☆39Updated 7 months ago
- This is the pytorch version of tcc loss, used in paper 'Temporal Cycle-Consistency Learning'.☆24Updated 3 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆46Updated last year
- [NeurIPS2022] Egocentric Video-Language Pretraining☆222Updated 4 months ago
- ☆77Updated 2 years ago