innat / VideoMAE
[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆21Updated last year
Alternatives and similar repositories for VideoMAE:
Users that are interested in VideoMAE are comparing it to the libraries listed below
- Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling☆33Updated 4 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆70Updated this week
- Vision Transformers for image classification, image segmentation, and object detection.☆50Updated 6 months ago
- Easiest way of fine-tuning HuggingFace video classification models☆141Updated 2 years ago
- XAI for yoloV8☆35Updated 2 months ago
- A clean, modular implementation of the Yolov7 model family, which uses the official pretrained weights, with utilities for training the m…☆114Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆182Updated 11 months ago
- Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.☆146Updated 2 years ago
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆849Updated last month
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆432Updated last year
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆38Updated last year
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆979Updated last year
- ☆69Updated last month
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆192Updated last year
- An SDK for Transformers + YOLO and other SSD family models☆61Updated 3 months ago
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models☆117Updated this week
- A modular PyTorch library for vision transformer models☆162Updated last year
- A personal implementation of YOLOv5 (v6.0)☆51Updated last year
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆629Updated 7 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- Object tracking implemented with the Roboflow Inference API, DeepSort, and OpenAI CLIP.☆373Updated last year
- Heatmap Learner Convolutional Neural Network for Object Counting and Localization☆43Updated last year
- A package to read and convert object detection datasets (COCO, YOLO, PascalVOC, LabelMe, CVAT, OpenImage, ...) and evaluate them with COC…☆197Updated 2 weeks ago
- The second generation of YOWO action detector.☆246Updated last year
- Repository containing community-contributed Ultralytics model configs.☆14Updated 2 weeks ago
- xLSTM as Generic Vision Backbone☆475Updated 6 months ago
- This repository contains the code for extracting bounding box coordinates from a binary segmentation mask.☆32Updated 3 years ago
- Keras implementation of ViT (Vision Transformer)☆348Updated 11 months ago
- ☆189Updated 2 months ago
- Code Release for MViTv2 on Image Recognition.☆424Updated 5 months ago