dmlc / decordLinks
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆2,413Updated last year
Alternatives and similar repositories for decord
Users that are interested in decord are comparing it to the libraries listed below
Sorting:
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,981Updated this week
- A deep learning library for video understanding research.☆3,541Updated 3 weeks ago
- ☆932Updated last year
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,192Updated last month
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,177Updated last year
- Video datasets☆1,606Updated 2 years ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,823Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,674Updated 2 years ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,762Updated this week
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,222Updated 3 weeks ago
- An end-to-end PyTorch framework for image and video classification☆1,612Updated last year
- TransNet V2: Shot Boundary Detection Neural Network☆845Updated 2 years ago
- This is an official implementation for "Video Swin Transformers".☆1,629Updated 2 years ago
- PyTorch media decoding and encoding☆940Updated this week
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆748Updated last year
- YACS -- Yet Another Configuration System☆1,331Updated 3 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Updated last year
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆471Updated 3 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,281Updated this week
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,990Updated last year
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,613Updated last week
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,997Updated last year
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,913Updated last year
- Easily create large video dataset from video urls☆648Updated last year
- Grounded Language-Image Pre-training☆2,570Updated 2 years ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆1,052Updated last year
- EVA Series: Visual Representation Fantasies from BAAI☆2,643Updated last year
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,810Updated 2 months ago
- Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification☆727Updated 4 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆644Updated last week