holistic-video-understanding / HVU-Downloader
HVU Downloader tool
☆17Updated 3 years ago
Related projects: ⓘ
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆70Updated 3 years ago
- ☆66Updated last year
- ☆74Updated 2 years ago
- Datasets, transforms and samplers for video in PyTorch☆86Updated 11 months ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆55Updated last year
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆213Updated 2 years ago
- A Dataset for Grounded Video Description☆158Updated 2 years ago
- ☆85Updated 2 years ago
- The Holistic Video Understanding Mini Dataset☆34Updated 4 years ago
- Audio Visual Instance Discrimination with Cross-Modal Agreement☆127Updated 3 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated last year
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆59Updated 3 years ago
- Mini-Kinetics-200 data splits used in paper "Rethinking Spatiotemporal Feature Learning For Video Understanding"☆80Updated 6 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆62Updated 2 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆98Updated 3 years ago
- HACS: Human Action Clips and Segments Dataset☆187Updated 4 years ago
- Feature Extractor module for videos using the PySlowFast framework☆76Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- Code for the HowTo100M paper☆250Updated 4 years ago
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Updated 2 years ago
- Code for our ICML 2019 paper "Temporal Gaussian Mixture Layer for Videos"☆101Updated 4 years ago
- Moments Retrieval Project Webpage (temporal)☆29Updated 8 months ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆158Updated 4 years ago
- This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters…☆114Updated 3 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆219Updated 2 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 3 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated 11 months ago
- ☆33Updated 5 years ago
- Starter Code for VALUE benchmark☆79Updated 2 years ago
- A PyTorch implementation of VIOLET☆136Updated 9 months ago