dmlc / decordLinks
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆2,295Updated last year
Alternatives and similar repositories for decord
Users that are interested in decord are comparing it to the libraries listed below
Sorting:
- A deep learning library for video understanding research.☆3,480Updated 7 months ago
- ☆880Updated last year
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,799Updated 2 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,661Updated last month
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,139Updated last year
- Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space …☆1,354Updated last year
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,758Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,097Updated 9 months ago
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,173Updated 2 weeks ago
- YACS -- Yet Another Configuration System☆1,319Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,570Updated last year
- An end-to-end PyTorch framework for image and video classification☆1,609Updated last year
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,036Updated last month
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,966Updated last year
- Video datasets☆1,503Updated 2 years ago
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,742Updated last year
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,287Updated last year
- TransNet V2: Shot Boundary Detection Neural Network☆739Updated last year
- Grounded Language-Image Pre-training☆2,493Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,580Updated 2 years ago
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆463Updated 2 years ago
- Python and OpenCV-based scene cut/transition detection program & library.☆4,181Updated last week
- An open-source toolbox for action understanding based on PyTorch☆1,874Updated 3 years ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,857Updated this week
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆772Updated 2 years ago
- VMZ: Model Zoo for Video Modeling☆1,051Updated 2 months ago
- Convolutional neural network model for video classification trained on the Kinetics dataset.☆1,804Updated 6 years ago
- VideoX: a collection of video cross-modal models☆1,043Updated last year
- EVA Series: Visual Representation Fantasies from BAAI☆2,560Updated last year
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,110Updated 3 months ago