facebookresearch / pytorchvideoLinks

A deep learning library for video understanding research.

☆3,461

Alternatives and similar repositories for pytorchvideo

Users that are interested in pytorchvideo are comparing it to the libraries listed below

Sorting:

facebookresearch / SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,034Updated 8 months ago
google-research / scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
☆3,618Updated 3 weeks ago
facebookresearch / dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,024Updated last year
facebookresearch / TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,732Updated last year
open-mmlab / mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
☆4,712Updated 11 months ago
dmlc / decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆2,252Updated last year
facebookresearch / deit
Official DeiT repository
☆4,239Updated last year
facebookresearch / vissl
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
☆3,285Updated last year
facebookresearch / ClassyVision
An end-to-end PyTorch framework for image and video classification
☆1,604Updated last year
facebookresearch / ConvNeXt
Code release for ConvNeXt model
☆6,076Updated 2 years ago
SwinTransformer / Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
☆1,568Updated 2 years ago
mit-han-lab / temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
☆2,134Updated last year
facebookresearch / d2go
D2Go is a toolkit for efficient deep learning
☆848Updated 9 months ago
facebookresearch / fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
☆2,157Updated 2 weeks ago
cvdfoundation / kinetics-dataset
☆874Updated last year
open-mmlab / mmcv
OpenMMLab Computer Vision Foundation
☆6,201Updated 3 months ago
libffcv / ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
☆2,952Updated last year
sail-sg / volo
VOLO: Vision Outlooker for Visual Recognition
☆946Updated 2 years ago
open-mmlab / mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT…
☆3,764Updated last year
MCG-NJU / VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,552Updated last year
facebookresearch / AugLy
A data augmentations library for audio, image, text, and video.
☆5,030Updated this week
dk-liang / Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,510Updated 6 months ago
facebookresearch / detr
End-to-End Object Detection with Transformers
☆14,547Updated last year
google-research / deeplab2
DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel …
☆1,027Updated 2 years ago
lucidrains / TimeSformer-pytorch
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆719Updated 3 years ago
open-mmlab / mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
☆3,721Updated 9 months ago
open-mmlab / mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
☆3,273Updated 2 years ago
apple / ml-cvnets
CVNets: A library for training computer vision networks
☆1,905Updated last year
facebookresearch / Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
☆1,961Updated last year
google-research / big_transfer
Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
☆1,529Updated last year