facebookresearch/MeMViT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/MeMViT)

facebookresearch / MeMViT

Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022

☆155

Alternatives and similar repositories for MeMViT

Users that are interested in MeMViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zeyun-zhong / AFFT
View on GitHub
Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.
☆32Aug 15, 2023Updated 2 years ago
facebookresearch / long_seq_mae
View on GitHub
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Oct 14, 2022Updated 3 years ago
zhaoyue-zephyrus / AVION
View on GitHub
[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"
☆138Aug 23, 2025Updated 11 months ago
chaoyuaw / lvu
View on GitHub
☆87Mar 4, 2024Updated 2 years ago
dibschat / tempAgg
View on GitHub
[ECCV 2020] Temporal Aggregate Representations for Long-Range Video Understanding
☆11Sep 13, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / Motionformer
View on GitHub
Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers
☆234Jun 13, 2022Updated 4 years ago
facebookresearch / LaViLa
View on GitHub
Code release for "Learning Video Representations from Large Language Models"
☆534Oct 1, 2023Updated 2 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,864Apr 9, 2024Updated 2 years ago
MartinXM / TPS
View on GitHub
A simple but efficient transformer model for video action recognition
☆64Oct 8, 2022Updated 3 years ago
MCG-NJU / VideoMAE-Action-Detection
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
☆70Feb 3, 2023Updated 3 years ago
facebookresearch / mvit
View on GitHub
Code Release for MViTv2 on Image Recognition.
☆456Nov 26, 2024Updated last year
4paradigm-CV / SE-STAD
View on GitHub
☆10Jan 3, 2023Updated 3 years ago
facebookresearch / ToMe
View on GitHub
A method to increase the speed and lower the memory footprint of existing vision transformers.
☆1,208Jun 17, 2024Updated 2 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,393Mar 16, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
facebookresearch / AVT
View on GitHub
Code release for ICCV 2021 paper "Anticipative Video Transformer"
☆154Feb 11, 2022Updated 4 years ago
seominseok0429 / inception-I3D-NON-LOCAL
View on GitHub
Inception-I3D, Non Local finetune, hmdb51_flow
☆15Oct 15, 2019Updated 6 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
StanfordVL / Sonicverse
View on GitHub
☆22Mar 18, 2023Updated 3 years ago
facebookresearch / dropout
View on GitHub
Code release for "Dropout Reduces Underfitting"
☆316May 6, 2023Updated 3 years ago
facebookresearch / omnivore
View on GitHub
Omnivore: A Single Model for Many Visual Modalities
☆573Nov 12, 2022Updated 3 years ago
lucidrains / memory-editable-transformer
View on GitHub
My explorations into editing the knowledge and memories of an attention network
☆35Dec 8, 2022Updated 3 years ago
cg1177 / VideoLLM
View on GitHub
VideoLLM: Modeling Video Sequence with Large Language Models
☆158Aug 18, 2023Updated 2 years ago
vlfom / RNCDL
View on GitHub
[NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".
☆110Jun 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
leonnnop / Locater
View on GitHub
[TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation
☆47Jan 20, 2024Updated 2 years ago
daniel-code / TubeViT
View on GitHub
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆95Jul 15, 2026Updated last week
NVlabs / FAN
View on GitHub
Official PyTorch implementation of Fully Attentional Networks
☆484Mar 31, 2023Updated 3 years ago
liuzhuang13 / anytime
View on GitHub
Anytime Dense Prediction with Confidence Adaptivity (ICLR 2022)
☆51Aug 23, 2024Updated last year
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
fpv-iplab / rulstm
View on GitHub
Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…
☆137Aug 23, 2023Updated 2 years ago
SamsungLabs / video-retrieval-sampler
View on GitHub
The official implementation for the paper 'mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval'.
☆11Aug 23, 2022Updated 3 years ago
amazon-science / long-short-term-transformer
View on GitHub
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
☆140Jul 25, 2024Updated last year
zhaoyue-zephyrus / TeSTra
View on GitHub
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
☆119Aug 23, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
md-mohaiminul / ViS4mer
View on GitHub
☆58Dec 2, 2025Updated 7 months ago
janghyuncho / ECM-Loss
View on GitHub
Code for "Long-tail Detection with Effective Class-Margins." (ECCV 2022 Oral)
☆62Sep 2, 2023Updated 2 years ago
motokimura / yolox-ti-lite_tflite
View on GitHub
YOLOX-ti-lite models exportable to TFLite
☆23Mar 27, 2023Updated 3 years ago
JialianW / GRiT
View on GitHub
GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)
☆341Jan 8, 2024Updated 2 years ago
facebookresearch / Listen-to-Look
View on GitHub
Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)
☆130Aug 31, 2021Updated 4 years ago
weiguoPian / AV-CIL_ICCV2023
View on GitHub
[ICCV 2023] Audio-Visual Class-Incremental Learning
☆35Sep 29, 2024Updated last year
rwightman / efficientnet-jax
View on GitHub
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
☆130Jan 4, 2024Updated 2 years ago