facebookresearch / pytorchvideoLinks
A deep learning library for video understanding research.
☆3,461Updated 6 months ago
Alternatives and similar repositories for pytorchvideo
Users that are interested in pytorchvideo are comparing it to the libraries listed below
Sorting:
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,034Updated 8 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,618Updated 3 weeks ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,024Updated last year
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,732Updated last year
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,712Updated 11 months ago
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆2,252Updated last year
- Official DeiT repository☆4,239Updated last year
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,285Updated last year
- An end-to-end PyTorch framework for image and video classification☆1,604Updated last year
- Code release for ConvNeXt model☆6,076Updated 2 years ago
- This is an official implementation for "Video Swin Transformers".☆1,568Updated 2 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,134Updated last year
- D2Go is a toolkit for efficient deep learning☆848Updated 9 months ago
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,157Updated 2 weeks ago
- ☆874Updated last year
- OpenMMLab Computer Vision Foundation☆6,201Updated 3 months ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,952Updated last year
- VOLO: Vision Outlooker for Visual Recognition☆946Updated 2 years ago
- OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT…☆3,764Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,552Updated last year
- A data augmentations library for audio, image, text, and video.☆5,030Updated this week
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,510Updated 6 months ago
- End-to-End Object Detection with Transformers☆14,547Updated last year
- DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel …☆1,027Updated 2 years ago
- Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification☆719Updated 3 years ago
- OpenMMLab Pre-training Toolbox and Benchmark☆3,721Updated 9 months ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,273Updated 2 years ago
- CVNets: A library for training computer vision networks☆1,905Updated last year
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,961Updated last year
- Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.☆1,529Updated last year