facebookresearch / pytorchvideo
A deep learning library for video understanding research.
☆3,342Updated last week
Alternatives and similar repositories for pytorchvideo:
Users that are interested in pytorchvideo are comparing it to the libraries listed below
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆6,650Updated last week
- Code release for ConvNeXt model☆5,797Updated last year
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,259Updated 9 months ago
- Official DeiT repository☆4,074Updated 8 months ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,201Updated last year
- An end-to-end PyTorch framework for image and video classification☆1,596Updated 5 months ago
- OpenMMLab Computer Vision Foundation☆5,926Updated last week
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,315Updated 3 months ago
- End-to-End Object Detection with Transformers☆13,684Updated 8 months ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆6,414Updated 5 months ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,868Updated 5 months ago
- An open-source toolbox for action understanding based on PyTorch☆1,864Updated 2 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆13,991Updated 4 months ago
- Geometric Computer Vision Library for Spatial AI☆10,019Updated this week
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,072Updated 4 months ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,815Updated 2 months ago
- Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"☆2,593Updated 4 months ago
- Codebase for Image Classification Research, written in PyTorch.☆2,141Updated 8 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,341Updated this week
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆1,905Updated 4 months ago
- OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT…☆3,583Updated last year
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,569Updated 7 months ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆4,666Updated 4 months ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,887Updated 8 months ago
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,395Updated last year
- A data augmentations library for audio, image, text, and video.☆4,968Updated last week
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,268Updated 6 months ago
- This is an official implementation for "Video Swin Transformers".☆1,455Updated last year
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,038Updated last week
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,384Updated 4 months ago