dmlc / decordLinks
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆2,401Updated last year
Alternatives and similar repositories for decord
Users that are interested in decord are comparing it to the libraries listed below
Sorting:
- A deep learning library for video understanding research.☆3,538Updated 2 weeks ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,973Updated 7 months ago
- ☆929Updated last year
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,176Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,265Updated this week
- Video datasets☆1,602Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,657Updated 2 years ago
- An end-to-end PyTorch framework for image and video classification☆1,611Updated last year
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,755Updated 2 weeks ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,178Updated last month
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,218Updated 2 weeks ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,822Updated last year
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,887Updated last year
- YACS -- Yet Another Configuration System☆1,330Updated 3 years ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,987Updated last year
- EVA Series: Visual Representation Fantasies from BAAI☆2,639Updated last year
- Grounded Language-Image Pre-training☆2,566Updated 2 years ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,928Updated this week
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Updated last year
- TransNet V2: Shot Boundary Detection Neural Network☆842Updated 2 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,996Updated last year
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆471Updated 3 years ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,330Updated 8 months ago
- An open-source toolbox for action understanding based on PyTorch☆1,875Updated 3 years ago
- Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP☆480Updated 3 years ago
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024☆1,803Updated last month
- This is an official implementation for "Video Swin Transformers".☆1,624Updated 2 years ago
- Extract frames and motion vectors from H.264 and MPEG-4 encoded video.☆385Updated 3 months ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆1,050Updated last year
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,689Updated last week