kenshohara / video-recognition
☆21Updated 5 years ago
Alternatives and similar repositories for video-recognition:
Users that are interested in video-recognition are comparing it to the libraries listed below
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated last year
- Implementations of Transformers for Video☆23Updated 3 years ago
- FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…☆76Updated 4 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆131Updated 3 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 3 years ago
- Code for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Mu…☆69Updated 2 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 3 years ago
- The Holistic Video Understanding Mini Dataset☆34Updated 4 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- Epic Kitchens Object Detector and Feature Extractor using Faster-RCNN with Detectron2☆22Updated 4 years ago
- A Comprehensive Tutorial on Video Modeling☆66Updated 4 years ago
- Datasets, transforms and samplers for video in PyTorch☆87Updated last year
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated last year
- ☆15Updated 5 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆102Updated 4 years ago
- [AAAI 2020] Temporal Interlacing Network☆84Updated 4 years ago
- The official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Effi…☆53Updated 4 years ago
- Unofficial sample code for Distilled 3D Networks (D3D) in Tensorflow.☆47Updated 6 years ago
- ☆70Updated last year
- PyTorch implementation of X3D models with Multigrid training.☆94Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆219Updated 2 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- Code for the paper "Generalizing Hand Segmentation in Egocentric Videos with Uncertainty-Guided Model Adaptation"☆36Updated 4 years ago
- SmallBigNet: Integrating Core and Contextual Views for Video Classification (CVPR2020)☆41Updated 3 years ago
- an implementation of mixup☆41Updated 4 years ago
- Mutual Modality Learning code☆15Updated 4 years ago
- ☆54Updated 3 years ago
- A unified framework to jointly model images, text, and human attention traces.☆78Updated 3 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 3 years ago