Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
☆70Jul 5, 2022Updated 3 years ago
Alternatives and similar repositories for pytorch-VideoDataset
Users that are interested in pytorch-VideoDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆472Jan 18, 2023Updated 3 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆24Jul 12, 2019Updated 6 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- Official Implementation of "Learning Disentangled Behavior Embeddings"☆14Nov 18, 2021Updated 4 years ago
- [ICIP 2021] PyTorch code for "The Mind's Eye: Visualizing Class-Agnostic Features of CNNs" for generation of kernel features.☆12Sep 12, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Network plotting python package☆12Nov 17, 2021Updated 4 years ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Dec 7, 2021Updated 4 years ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Jan 11, 2022Updated 4 years ago
- Official implementation for TAO (CVPR 2025)☆19Jan 1, 2026Updated 4 months ago
- Implementation of ViViT: A Video Vision Transformer☆560Jun 21, 2021Updated 4 years ago
- Lazy python recipes.☆10Apr 17, 2026Updated last month
- Distributive pipelines for phenomic data analysis☆13Nov 12, 2021Updated 4 years ago
- MYNOVA-Sparkin is an open-source hardware project: a wireless Bluetooth fingerprint reader for Windows, offering 0.5s identification, loc…☆25Feb 9, 2026Updated 3 months ago
- This is a tutorial on how to create a Term-Document Matrix from Elasticsearch.☆11Jan 29, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Video dataset class for loading videos in PyTorch using Dataloader☆129Feb 8, 2020Updated 6 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- Mining (maximal) Span-cores from Temporal Networks☆13Nov 27, 2018Updated 7 years ago
- Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools an…☆13Mar 8, 2024Updated 2 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Jun 30, 2020Updated 5 years ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- ☆11Mar 26, 2019Updated 7 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- Inflated 3D ConvNets for video understanding☆49Oct 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is an implementation of Image2StyleGAN embedding algorithm and various experiments using StyleGAN2-ADA as backbone.☆17Sep 2, 2021Updated 4 years ago
- ☆11Feb 1, 2024Updated 2 years ago
- Reimplementation of Wasserstein Auto Encoder (WAE) with Wasserstein GAN based penalty D_Z in Tensorflow☆12Mar 28, 2019Updated 7 years ago
- Pytorch implementation of lfads, and hierarchical extension☆26Dec 2, 2021Updated 4 years ago
- Detection of OD and Fovea for IDRiD Diabetic Retinopathy dataset using FasterRCNN and RetinaNet. (MVA Medical Imaging class final project…☆11Apr 6, 2020Updated 6 years ago
- Official code for 'Tackling Structural Hallucination in Image Translation with Local Diffusion' (ECCV'24 Oral)☆27Sep 17, 2024Updated last year
- ☆16Sep 11, 2025Updated 8 months ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆12Aug 28, 2020Updated 5 years ago
- A head-mounted camera system integrates detailed behavioral monitoring with multichannel electrophysiology in freely moving mice☆29Jul 27, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆23Jul 16, 2025Updated 10 months ago
- ☆11May 8, 2024Updated 2 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 6 years ago
- Easy to use video deep features extractor☆322Jul 5, 2020Updated 5 years ago
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…☆39Dec 27, 2025Updated 5 months ago
- Convolutional Restricted Boltzmann Machine☆15May 10, 2017Updated 9 years ago
- Reproduction of the first step in the text-to-video model Phenaki. Code and model weights for the Transformer-based autoencoder for video…☆29Aug 4, 2023Updated 2 years ago