facebookresearch / pytorchvideo
A deep learning library for video understanding research.
☆3,389Updated 3 weeks ago
Alternatives and similar repositories for pytorchvideo:
Users that are interested in pytorchvideo are comparing it to the libraries listed below
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆6,794Updated 2 months ago
- Code release for ConvNeXt model☆5,872Updated 2 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,266Updated 11 months ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆6,605Updated 7 months ago
- Official DeiT repository☆4,137Updated 11 months ago
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,077Updated 2 months ago
- An end-to-end PyTorch framework for image and video classification☆1,599Updated 7 months ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆14,302Updated 6 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,430Updated 3 weeks ago
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,450Updated 6 months ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,625Updated 10 months ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,227Updated last year
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,092Updated 7 months ago
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆2,020Updated 7 months ago
- Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"☆2,677Updated 6 months ago
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,444Updated last month
- This is an official implementation for "Video Swin Transformers".☆1,493Updated last year
- OpenMMLab Computer Vision Foundation☆6,014Updated this week
- 🐍 Geometric Computer Vision Library for Spatial AI☆10,235Updated last week
- OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT…☆3,642Updated last year
- A python library for self-supervised learning on images.☆3,284Updated this week
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,895Updated 8 months ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,401Updated 9 months ago
- An open-source toolbox for action understanding based on PyTorch☆1,867Updated 2 years ago
- The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.☆6,088Updated 2 months ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆8,542Updated 6 months ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,890Updated 2 months ago
- A PyTorch implementation of EfficientNet☆8,023Updated 2 years ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,534Updated 4 years ago
- A data augmentations library for audio, image, text, and video.☆4,988Updated 2 weeks ago