innat / VideoMAE
[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆20Updated last year
Alternatives and similar repositories for VideoMAE:
Users that are interested in VideoMAE are comparing it to the libraries listed below
- Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling☆34Updated 4 months ago
- Easiest way of fine-tuning HuggingFace video classification models☆141Updated 2 years ago
- A really more real-time adaptation of deep sort☆198Updated 7 months ago
- Self-Supervised Learning in PyTorch☆136Updated last year
- An SDK for Transformers + YOLO and other SSD family models☆61Updated 2 months ago
- The second generation of YOWO action detector.☆246Updated 11 months ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆971Updated last year
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Pose models☆136Updated 9 months ago
- Timm model explorer☆39Updated last year
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆619Updated 6 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆68Updated this week
- A Modular End-to-End Tracking Framework for Research and Development 🎯🔬☆133Updated this week
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Face models☆65Updated last year
- Vision Transformer Cookbook with Tensorflow☆333Updated 3 years ago
- ☆186Updated 2 months ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆97Updated 11 months ago
- Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)☆270Updated last year
- Efficient violence detection in surveillance videos using Human Skeletons and Motion Estimation☆49Updated last year
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆190Updated last year
- Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.☆100Updated 3 years ago
- ☆36Updated 2 years ago
- This repository contains the MPOSE2021 Dataset for short-time pose-based Human Action Recognition (HAR).☆54Updated last year
- ☆60Updated 3 years ago
- ☆94Updated 7 months ago
- Continuation of an abandoned project fast-coco-eval☆100Updated 2 months ago
- Code Release for MViTv2 on Image Recognition.☆422Updated 4 months ago
- SFSORT: Scene Features-based Simple Online Real-Time Tracker☆54Updated 3 months ago
- Vision Transformers for image classification, image segmentation, and object detection.☆49Updated 6 months ago
- Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,fl…☆614Updated 3 months ago
- Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information☆56Updated last year