PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆87Sep 13, 2021Updated 4 years ago
Alternatives and similar repositories for SlowFast
Users that are interested in SlowFast are comparing it to the libraries listed below
Sorting:
- Implementation of ViViT: A Video Vision Transformer☆556Jun 21, 2021Updated 4 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆305May 4, 2022Updated 3 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- A feishu bot daily push arxiv latest articles.☆10Nov 28, 2021Updated 4 years ago
- Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?☆12Dec 22, 2020Updated 5 years ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 4 months ago
- A pytorch implementation of "Robust Facial Landmark Detection by Multi-order Multi-constrained Network"☆13Dec 9, 2020Updated 5 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆104Aug 12, 2020Updated 5 years ago
- ☆31Sep 20, 2021Updated 4 years ago
- This is an official implementation for "Video Swin Transformers".☆1,632Mar 8, 2023Updated 2 years ago
- AuxFormer: Robust Approach to Audiovisual Emotion Recognition☆14Mar 14, 2023Updated 2 years ago
- ☆17Jun 17, 2021Updated 4 years ago
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆51Jul 9, 2022Updated 3 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆41Feb 28, 2024Updated 2 years ago
- PyTorch Implementation of TCSVT 2020 "Blind Omnidirectional Image Quality Assessment with Viewport Oriented Graph Convolutional Networks"☆24Mar 19, 2021Updated 4 years ago
- Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation☆17Nov 20, 2022Updated 3 years ago
- ☆69Apr 26, 2021Updated 4 years ago
- This is the official implementation of "Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation"☆41Dec 1, 2024Updated last year
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- AFNet(NeurIPS 2022)☆20Nov 24, 2022Updated 3 years ago
- SmallBigNet: Integrating Core and Contextual Views for Video Classification (CVPR2020)☆41Mar 10, 2022Updated 3 years ago
- Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning☆21Jun 20, 2024Updated last year
- [WACV '23] AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models☆27Jun 23, 2023Updated 2 years ago
- ☆21Apr 10, 2019Updated 6 years ago
- implement "2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning"☆21Mar 24, 2023Updated 2 years ago
- Implementation of the paper Video Action Transformer Network☆138Apr 5, 2021Updated 4 years ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,830Apr 9, 2024Updated last year
- A simple but efficient transformer model for video action recognition☆62Oct 8, 2022Updated 3 years ago
- Code for the UAV payload☆10Jun 16, 2017Updated 8 years ago
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆32Dec 7, 2022Updated 3 years ago
- ☆39Nov 22, 2024Updated last year
- ☆28Jul 9, 2021Updated 4 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- [CVPR'22] DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition☆27Sep 28, 2022Updated 3 years ago
- ☆193Oct 22, 2022Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,681Dec 8, 2023Updated 2 years ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆39Apr 20, 2025Updated 10 months ago
- My implementation for the paper Context-Aware Emotion Recognition Networks☆30Mar 12, 2022Updated 3 years ago
- [CVPR23] PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation☆39Jul 7, 2023Updated 2 years ago