☆129Jun 27, 2021Updated 4 years ago
Alternatives and similar repositories for annotations
Users that are interested in annotations are comparing it to the libraries listed below
Sorting:
- Identifying Visible Actions in Lifestyle Vlogs☆15Aug 3, 2023Updated 2 years ago
- ☆95Feb 14, 2022Updated 4 years ago
- ☆48Apr 27, 2020Updated 5 years ago
- Code for the HowTo100M paper☆293Mar 10, 2020Updated 5 years ago
- ☆148Mar 4, 2019Updated 6 years ago
- 🍴 Annotations for the EPIC KITCHENS-55 Dataset.☆155Mar 17, 2021Updated 4 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- ☆80Sep 4, 2022Updated 3 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 3 years ago
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?☆12Dec 22, 2020Updated 5 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 7 months ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆23May 17, 2021Updated 4 years ago
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆112Jan 25, 2021Updated 5 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Mar 11, 2021Updated 4 years ago
- Long-Term Feature Banks for Detailed Video Understanding☆384Aug 30, 2021Updated 4 years ago
- ☆54Jan 21, 2023Updated 3 years ago
- Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization (CVPR2019)☆152Mar 24, 2023Updated 2 years ago
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆108Dec 18, 2020Updated 5 years ago
- A large (>5k) collection of search questions asked about Coronavirus 🦠☆14Mar 21, 2020Updated 5 years ago
- ☆252Nov 13, 2023Updated 2 years ago
- In-the-wild Question Answering☆15May 10, 2023Updated 2 years ago
- Implementation for Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV2020)☆47Dec 2, 2020Updated 5 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆48Jun 22, 2024Updated last year
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆165Aug 1, 2022Updated 3 years ago
- CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement☆74Oct 8, 2021Updated 4 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 3 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆33Aug 4, 2020Updated 5 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- A Dataset for Grounded Video Description☆163Jan 4, 2022Updated 4 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆87Dec 15, 2020Updated 5 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- The Holistic Video Understanding Mini Dataset☆34Apr 8, 2020Updated 5 years ago
- Code for our ICML 2019 paper "Temporal Gaussian Mixture Layer for Videos"☆102Oct 7, 2019Updated 6 years ago
- ☆19May 2, 2020Updated 5 years ago
- HACS: Human Action Clips and Segments Dataset☆197Apr 23, 2020Updated 5 years ago