hadjisma / VideoAlignment
☆54Updated 3 years ago
Alternatives and similar repositories for VideoAlignment:
Users that are interested in VideoAlignment are comparing it to the libraries listed below
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆61Updated 3 years ago
- ☆33Updated 3 years ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆90Updated last year
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- ☆67Updated last year
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆152Updated 3 years ago
- ☆28Updated 2 years ago
- Official Implementation for "Fast Weakly Supervised Action Segmentation Using Mutual Consistency" - TPAMI 2021☆20Updated 3 years ago
- ☆33Updated 2 years ago
- ☆44Updated 3 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆32Updated 2 years ago
- Code for ''Alleviating Over-segmentation Errors by Detecting Action Boundaries'' accepted in WACV2021☆58Updated last year
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- ☆17Updated 4 years ago
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Updated 2 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆115Updated last year
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆84Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Implementation of paper "Modeling Multi-Label Action Dependencies for Temporal Action Localization"☆50Updated last year
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 3 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- Download scripts for EPIC-KITCHENS☆129Updated 7 months ago
- ☆83Updated last year
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆26Updated 2 years ago
- [WACV2021] Implementation of Pyramid Dilated Attention Network (PDAN)☆19Updated 2 years ago
- Official repo for ECCV 2020 paper - RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition☆100Updated 4 years ago
- Reducing spatial redundancy in video recognition. SOTA computational efficiency.☆124Updated 2 months ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 3 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆164Updated 3 years ago