holistic-video-understanding / HVU-DownloaderLinks
HVU Downloader tool
☆17Updated 4 years ago
Alternatives and similar repositories for HVU-Downloader
Users that are interested in HVU-Downloader are comparing it to the libraries listed below
Sorting:
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Updated 4 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆218Updated 2 years ago
- ☆91Updated 3 years ago
- Code for our ICML 2019 paper "Temporal Gaussian Mixture Layer for Videos"☆101Updated 5 years ago
- A Dataset for Grounded Video Description☆162Updated 3 years ago
- The Holistic Video Understanding Mini Dataset☆34Updated 5 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- Moments Retrieval Project Webpage (temporal)☆31Updated last year
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆44Updated 4 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 5 years ago
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Updated 3 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆112Updated 4 years ago
- Feature Extractor module for videos using the PySlowFast framework☆79Updated 4 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆100Updated 4 years ago
- Audio Visual Instance Discrimination with Cross-Modal Agreement☆129Updated 3 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆162Updated 5 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆222Updated 2 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆195Updated 4 years ago
- Mixture-of-Embeddings-Experts☆119Updated 4 years ago
- ☆43Updated 4 years ago
- ☆73Updated 3 years ago
- Kernel Temporal Segmentation☆55Updated 6 years ago
- Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"☆197Updated 4 years ago
- HACS: Human Action Clips and Segments Dataset☆193Updated 5 years ago
- Code for the paper: "Sentence Specified Dynamic Video Thumbnail Generation"☆33Updated 5 years ago