innat / VideoMAELinks
[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆21Updated last year
Alternatives and similar repositories for VideoMAE
Users that are interested in VideoMAE are comparing it to the libraries listed below
Sorting:
- Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling☆33Updated 5 months ago
- Self-Supervised Learning in PyTorch☆138Updated last year
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆473Updated 2 years ago
- A clean, modular implementation of the Yolov7 model family, which uses the official pretrained weights, with utilities for training the m…☆115Updated last year
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆990Updated last year
- ☆70Updated 2 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆70Updated this week
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆187Updated 11 months ago
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆435Updated last year
- Repository containing community-contributed Ultralytics model configs.☆16Updated last month
- Vision Transformers for image classification, image segmentation, and object detection.☆51Updated 7 months ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆340Updated 6 months ago
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆38Updated last year
- Draw bounding boxes on raw images based on YOLO format annotation. Help to check the correctness of annotation and extract the images wit…☆97Updated 10 months ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆642Updated 8 months ago
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆96Updated 9 months ago
- A really more real-time adaptation of deep sort☆207Updated 9 months ago
- Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!☆147Updated 2 years ago
- A personal implementation of YOLOv5 (v6.0)☆52Updated last year
- Pytorch to Keras/Tensorflow/TFLite conversion made intuitive☆311Updated 2 months ago
- A Modular End-to-End Tracking Framework for Research and Development 🎯🔬☆153Updated last week
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆176Updated 3 years ago
- Code Release for MViTv2 on Image Recognition.☆427Updated 6 months ago
- End-to-End Object Detection with Transformers☆51Updated last month
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆154Updated last year
- Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all crop…☆102Updated 2 months ago
- ☆203Updated last year
- Timm model explorer☆39Updated last year
- PyTorch Faster R-CNN Object Detection on Custom Dataset☆247Updated this week
- A curated list of plugins that you can add to your FiftyOne install!☆123Updated last week