Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
☆470Jan 18, 2023Updated 3 years ago
Alternatives and similar repositories for Video-Dataset-Loading-Pytorch
Users that are interested in Video-Dataset-Loading-Pytorch are comparing it to the libraries listed below
Sorting:
- Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.☆70Jul 5, 2022Updated 3 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆289Oct 10, 2021Updated 4 years ago
- Video datasets☆1,618Mar 8, 2023Updated 2 years ago
- Transforms for video datasets in pytorch☆277Jun 7, 2021Updated 4 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 3 years ago
- A deep learning library for video understanding research.☆3,544Jan 12, 2026Updated last month
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 3 years ago
- GPU-accelerated video decoder☆20May 18, 2021Updated 4 years ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Jun 13, 2023Updated 2 years ago
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆2,431Jul 17, 2024Updated last year
- Implementation of ViViT: A Video Vision Transformer☆556Jun 21, 2021Updated 4 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99May 2, 2022Updated 3 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,299Feb 19, 2026Updated 2 weeks ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Aug 1, 2022Updated 3 years ago
- A dataset for multiview 3D human pose estimation with detailed occlusion labels, powered by UnrealCV☆43Oct 29, 2020Updated 5 years ago
- Multiple Object Tracking with Transformer☆673Apr 30, 2023Updated 2 years ago
- A collection of resources and papers on diffusion models of video generation.☆10Feb 11, 2023Updated 3 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,184Jul 11, 2024Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,635Mar 8, 2023Updated 2 years ago
- Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101☆972Dec 7, 2020Updated 5 years ago
- This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.☆257Oct 8, 2021Updated 4 years ago
- Effective Video Augmentation Techniques for Training Convolutional Neural Networks☆413Feb 13, 2024Updated 2 years ago
- PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.☆1,234Dec 27, 2023Updated 2 years ago
- The Holistic Video Understanding Mini Dataset☆34Apr 8, 2020Updated 5 years ago
- Official repo for Directional Self-supervised Learning for Heavy Image Augmentations [CVPR2022]☆12Jun 29, 2022Updated 3 years ago
- 3D ResNets for Action Recognition (CVPR 2018)☆4,043Jan 20, 2021Updated 5 years ago
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆572Jan 30, 2026Updated last month
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- Object detection achieving 44.3 mAP / 45 fps on COCO dataset☆169Oct 27, 2020Updated 5 years ago
- MAST: A Memory-Augmented Self-supervised Tracker (CVPR 2020)☆272Aug 8, 2020Updated 5 years ago
- ☆1,041Jun 28, 2020Updated 5 years ago
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆182Oct 19, 2021Updated 4 years ago
- A simple approach to enable dense segmentation with ViT.☆15Oct 26, 2021Updated 4 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated 10 months ago
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"☆823Jul 14, 2022Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,683Dec 8, 2023Updated 2 years ago
- Implementations of Transformers for Video☆24Mar 26, 2021Updated 4 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated 2 years ago