An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆2,439Jul 17, 2024Updated last year
Alternatives and similar repositories for decord
Users that are interested in decord are comparing it to the libraries listed below
Sorting:
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,314Updated this week
- A deep learning library for video understanding research.☆3,550Jan 12, 2026Updated 2 months ago
- Pythonic bindings for FFmpeg's libraries.☆3,141Mar 14, 2026Updated last week
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,951Updated this week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,647Updated this week
- An open-source toolbox for action understanding based on PyTorch☆1,875Apr 8, 2022Updated 3 years ago
- Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space …☆1,375Jun 10, 2024Updated last year
- 🐍 Geometric Computer Vision Library for Spatial AI☆11,121Updated this week
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,219Dec 15, 2025Updated 3 months ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,698Dec 8, 2023Updated 2 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,504Mar 13, 2026Updated last week
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,195Jul 11, 2024Updated last year
- Python and OpenCV-based scene cut/transition detection program & library.☆4,635Mar 3, 2026Updated 2 weeks ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆3,022Feb 9, 2026Updated last month
- An open source implementation of CLIP.☆13,528Mar 12, 2026Updated last week
- This repository is intended to host tools and demos for ActivityNet☆969Mar 21, 2024Updated 2 years ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,430Feb 20, 2026Updated last month
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Mar 3, 2024Updated 2 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,936Updated this week
- Easily create large video dataset from video urls☆653Jul 30, 2024Updated last year
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,838Apr 9, 2024Updated last year
- VMZ: Model Zoo for Video Modeling☆1,053Jun 17, 2025Updated 9 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,189Nov 18, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,861Feb 18, 2026Updated last month
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,986Jun 16, 2024Updated last year
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,286Jun 25, 2025Updated 8 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,777Mar 6, 2026Updated 2 weeks ago
- 3D ResNets for Action Recognition (CVPR 2018)☆4,042Jan 20, 2021Updated 5 years ago
- OpenMMLab Computer Vision Foundation☆6,415Jan 29, 2026Updated last month
- PyTorch media decoding and encoding☆1,009Updated this week
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,380Oct 19, 2025Updated 5 months ago
- Fast and memory-efficient exact attention☆22,832Updated this week
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,476Jul 3, 2024Updated last year
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,996Mar 21, 2024Updated 2 years ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- Codebase for Image Classification Research, written in PyTorch.☆2,166Mar 20, 2024Updated 2 years ago
- A repository of common methods, datasets, and tasks for video research☆538Jun 17, 2019Updated 6 years ago
- This is an official implementation for "Video Swin Transformers".☆1,638Mar 8, 2023Updated 3 years ago
- End-to-End Object Detection with Transformers☆15,166Mar 12, 2024Updated 2 years ago