innat / VideoMAE
[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆15Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for VideoMAE
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆34Updated last year
- Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling☆26Updated 7 months ago
- Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,fl…☆597Updated last month
- Self-Supervised Learning in PyTorch☆127Updated 7 months ago
- TensorFlow port of PyTorch Image Models (timm) - image models with pretrained weights.☆287Updated last month
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆25Updated last year
- Vision Transformer Cookbook with Tensorflow☆307Updated 2 years ago
- Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.☆100Updated 2 years ago
- Heatmap Learner Convolutional Neural Network for Object Counting and Localization☆42Updated 9 months ago
- Light weight toolkit for bounding boxes providing conversion between bounding box types and simple computations.☆148Updated last month
- Vision Transformers for image classification, image segmentation, and object detection.☆43Updated 3 weeks ago
- A library that includes Keras3 layers, blocks and models with pretrained weights, providing support for transfer learning, feature extrac…☆40Updated 3 weeks ago
- Pytorch to Keras/Tensorflow/TFLite conversion made intuitive☆267Updated 2 months ago
- A clean, modular implementation of the Yolov7 model family, which uses the official pretrained weights, with utilities for training the m…☆116Updated 9 months ago
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆106Updated 2 years ago
- Keras implementation of ViT (Vision Transformer)☆338Updated 5 months ago
- A multi-backend (TensorFlow, PyTorch, JAX, and NumPy) implementation of the Segment Anything model in Keras 3.0☆30Updated 7 months ago
- A Detection Toolbox for Tensorflow2☆56Updated last year
- YOLOv7 Object Blurring Using PyTorch and OpenCV☆64Updated this week
- A personal implementation of YOLOv5 (v6.0)☆47Updated last year
- Tensorflow implementation of Swin Transformer model.☆204Updated 2 years ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 2 years ago
- Easiest way of fine-tuning HuggingFace video classification models☆133Updated last year
- A Keras implementation of hybrid efficientnet swin transformer model.☆33Updated last year
- EscVM YouTube Channel Repository. Start from Notebooks ⬅️☆63Updated 2 months ago
- self defined efficientnetV2 according to official version. Including converted ImageNet/21K/21k-ft1k weights.☆77Updated 2 years ago
- In this tutorial, you will perform inference across 10 well-known pre-trained semantic segmentors and fine-tune on a custom dataset. Desi…☆74Updated 2 years ago
- ☆31Updated 2 years ago
- 2nd Place Google - Isolated Sign Language Recognition☆47Updated last year
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆445Updated last year