Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
☆70Jul 5, 2022Updated 3 years ago
Alternatives and similar repositories for pytorch-VideoDataset
Users that are interested in pytorch-VideoDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆470Jan 18, 2023Updated 3 years ago
- Deep Neural Networks for Video Classification☆46Nov 22, 2022Updated 3 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆24Jul 12, 2019Updated 6 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- I3D implemetation in Keras + video preprocessing + visualization of results☆42Sep 10, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICIP 2021] PyTorch code for "The Mind's Eye: Visualizing Class-Agnostic Features of CNNs" for generation of kernel features.☆12Sep 12, 2021Updated 4 years ago
- Network plotting python package☆12Nov 17, 2021Updated 4 years ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Dec 7, 2021Updated 4 years ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Jan 11, 2022Updated 4 years ago
- Official implementation for TAO (CVPR 2025)☆20Jan 1, 2026Updated 5 months ago
- Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101☆972Dec 7, 2020Updated 5 years ago
- Implementation of ViViT: A Video Vision Transformer☆560Jun 21, 2021Updated 4 years ago
- ☆15Jul 9, 2019Updated 6 years ago
- Distributive pipelines for phenomic data analysis☆13Nov 12, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Gaze estimation from 2D image☆13Dec 17, 2024Updated last year
- A reinforcement learning agent playing as the turret, where its goal is to allow ten friendly units to enter the base, and loses if an en…☆14Dec 24, 2020Updated 5 years ago
- Video dataset class for loading videos in PyTorch using Dataloader☆127Feb 8, 2020Updated 6 years ago
- Keras implementation of video classifier☆111Jul 23, 2018Updated 7 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- ☆13Nov 15, 2024Updated last year
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 5 years ago
- Tensorflow implementation of Unsupervised learning of object landmarks by factorized spatial embeddings☆30Feb 19, 2019Updated 7 years ago
- This repo is for me to break down the details of the paper The Unreasonable Effectiveness of Deep Features as a Perceptual Metric☆18Feb 5, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools an…☆13Mar 8, 2024Updated 2 years ago
- Use Claude Code in OpenCode - created from idea https://github.com/anomalyco/opencode/issues/9677☆77Apr 26, 2026Updated last month
- ☆28Aug 9, 2025Updated 10 months ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- ☆15Dec 16, 2023Updated 2 years ago
- Inflated 3D ConvNets for video understanding☆49Oct 5, 2023Updated 2 years ago
- ☆11Feb 1, 2024Updated 2 years ago
- Reimplementation of Wasserstein Auto Encoder (WAE) with Wasserstein GAN based penalty D_Z in Tensorflow☆12Mar 28, 2019Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Detection of OD and Fovea for IDRiD Diabetic Retinopathy dataset using FasterRCNN and RetinaNet. (MVA Medical Imaging class final project…☆11Apr 6, 2020Updated 6 years ago
- Video classification tools using 3D ResNet☆1,131Nov 23, 2018Updated 7 years ago
- ☆17Sep 11, 2025Updated 9 months ago
- 基于hrnet的backbone改进centernet☆23Aug 14, 2019Updated 6 years ago
- ☆24Sep 2, 2022Updated 3 years ago
- Training and evaluating self-supervised deep neural networks☆27Sep 10, 2017Updated 8 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆26Jul 16, 2025Updated 11 months ago