holistic-video-understanding / HVU-DownloaderLinks
HVU Downloader tool
☆16Updated 5 years ago
Alternatives and similar repositories for HVU-Downloader
Users that are interested in HVU-Downloader are comparing it to the libraries listed below
Sorting:
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆220Updated 3 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Updated 4 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆223Updated 3 years ago
- A Dataset for Grounded Video Description☆162Updated 3 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆198Updated 5 years ago
- HACS: Human Action Clips and Segments Dataset☆195Updated 5 years ago
- Code for the HowTo100M paper☆283Updated 5 years ago
- ☆93Updated 3 years ago
- Transforms for video datasets in pytorch☆276Updated 4 years ago
- Mini-Kinetics-200 data splits used in paper "Rethinking Spatiotemporal Feature Learning For Video Understanding"☆80Updated 7 years ago
- Feature Extractor module for videos using the PySlowFast framework☆79Updated 4 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆288Updated 4 years ago
- Long-Term Feature Banks for Detailed Video Understanding☆384Updated 4 years ago
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆113Updated 4 years ago
- Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer☆152Updated 4 years ago
- ☆191Updated 4 months ago
- Video embeddings for retrieval with natural language queries☆342Updated 2 years ago
- ☆69Updated 2 years ago
- PyTorch implementation of X3D models with Multigrid training.☆96Updated 4 years ago
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Updated 3 years ago
- Diagnosing Error in Temporal Action Detectors (ECCV 2018)☆75Updated 3 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- Code for our ICML 2019 paper "Temporal Gaussian Mixture Layer for Videos"☆102Updated 6 years ago
- A PyTorch implementation of VIOLET☆139Updated last year
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆162Updated 5 years ago
- Kernel Temporal Segmentation☆57Updated 6 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Updated 2 years ago
- Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.☆252Updated 4 years ago
- [NeurIPS 2019] Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition☆85Updated last year
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Updated 4 years ago