☆95Feb 14, 2022Updated 4 years ago
Alternatives and similar repositories for CrossTask
Users that are interested in CrossTask are comparing it to the libraries listed below
Sorting:
- Code for the HowTo100M paper☆293Mar 10, 2020Updated 5 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- ☆19May 2, 2020Updated 5 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- Code for the paper "Unsupervised Learning from Narrated Instruction Videos", CVPR2016☆20Jul 27, 2016Updated 9 years ago
- ☆129Jun 27, 2021Updated 4 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- Weakly-supervised action segmentation in video☆16Feb 13, 2022Updated 4 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- ☆78Aug 16, 2021Updated 4 years ago
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆165Aug 1, 2022Updated 3 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆10Aug 3, 2023Updated 2 years ago
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆48Jun 22, 2024Updated last year
- Video narrator written in Python/GTK using vlc-lib☆25Jun 22, 2022Updated 3 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 4 years ago
- EPIC-KITCHENS-55 dataset python library☆31Jun 21, 2022Updated 3 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- EPIC-KITCHENS-55 baselines for Action Recognition☆75Jul 14, 2020Updated 5 years ago
- ☆252Nov 13, 2023Updated 2 years ago
- 🍴 Annotations for the EPIC KITCHENS-55 Dataset.☆155Mar 17, 2021Updated 4 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆289Oct 10, 2021Updated 4 years ago
- Code for the paper "Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric V…☆20Jan 9, 2025Updated last year
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Project and dataset webpage:☆286Oct 12, 2023Updated 2 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 3 years ago
- ☆18Dec 13, 2019Updated 6 years ago
- ☆48Apr 27, 2020Updated 5 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 3 years ago
- ☆33Nov 12, 2018Updated 7 years ago
- RareAct: A video dataset of unusual interactions☆33Aug 4, 2020Updated 5 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Mar 11, 2021Updated 4 years ago
- Code for Oops! Predicting Unintentional Action in Video☆80Apr 13, 2020Updated 5 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- ☆33Mar 14, 2021Updated 4 years ago
- Easy to use video deep features extractor☆322Jul 5, 2020Updated 5 years ago