innat / VideoMAE
[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆16Updated last year
Alternatives and similar repositories for VideoMAE:
Users that are interested in VideoMAE are comparing it to the libraries listed below
- Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling☆29Updated last month
- ☆62Updated last week
- Self-Supervised Learning in PyTorch☆133Updated 10 months ago
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆36Updated last year
- Light weight toolkit for bounding boxes providing conversion between bounding box types and simple computations.☆149Updated 3 months ago
- A clean, modular implementation of the Yolov7 model family, which uses the official pretrained weights, with utilities for training the m…☆115Updated 11 months ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 2 years ago
- Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!☆147Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆56Updated last week
- Vision Transformers for image classification, image segmentation, and object detection.☆43Updated 3 months ago
- Torch nn vizualization☆51Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆54Updated 11 months ago
- This folder of code contains code and notebooks to supplement the "Vision Transformers Explained" series published on Towards Data Scienc…☆71Updated 8 months ago
- TensorFlow port of PyTorch Image Models (timm) - image models with pretrained weights.☆287Updated 3 months ago
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆151Updated 7 months ago
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆107Updated 2 years ago
- Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.☆100Updated 2 years ago
- Minimal PyTorch implementation of YOLOv5 and StrongSort☆65Updated 2 years ago
- Run zero-shot prediction models on your data☆30Updated last month
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆163Updated last year
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆160Updated 3 years ago
- Contains the "pycocotools" package on PyPI. Changes made to the official cocoapi about packaging.☆134Updated 5 months ago
- A really more real-time adaptation of deep sort☆175Updated 5 months ago
- FiftyOne Plugin for finding common image quality issues☆31Updated 3 months ago
- Probing the representations of Vision Transformers.☆319Updated 2 years ago
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆426Updated last year
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆25Updated last year
- ☆198Updated last year
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆454Updated last year
- An SDK for Transformers + YOLO and other SSD family models☆55Updated last month