facebookresearch/Motionformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/Motionformer)

facebookresearch / Motionformer

Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers

☆234

Alternatives and similar repositories for Motionformer

Users that are interested in Motionformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

airsplay / vimpac
View on GitHub
☆73Jun 3, 2022Updated 4 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,864Apr 9, 2024Updated 2 years ago
TengdaHan / CoCLR
View on GitHub
[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.
☆288Oct 10, 2021Updated 4 years ago
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
dibschat / tempAgg
View on GitHub
[ECCV 2020] Temporal Aggregate Representations for Long-Range Video Understanding
☆11Sep 13, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆614Dec 6, 2023Updated 2 years ago
Chuhanxx / Temporal_Query_Networks
View on GitHub
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Mar 9, 2022Updated 4 years ago
facebookresearch / omnivore
View on GitHub
Omnivore: A Single Model for Many Visual Modalities
☆573Nov 12, 2022Updated 3 years ago
ju-chen / Efficient-Prompt
View on GitHub
☆197Oct 22, 2022Updated 3 years ago
ekazakos / MTCN
View on GitHub
Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch
☆20Dec 16, 2021Updated 4 years ago
microsoft / CtP
View on GitHub
☆45Apr 30, 2021Updated 5 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,394Mar 16, 2026Updated 4 months ago
TengdaHan / MemDPC
View on GitHub
[ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.
☆167Apr 29, 2021Updated 5 years ago
facebookresearch / MeMViT
View on GitHub
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
☆155Nov 30, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
StanLei52 / TQVSR
View on GitHub
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆24Sep 11, 2023Updated 2 years ago
yzfly / TCM
View on GitHub
TCM: Temporal Correlation Module
☆17Apr 24, 2021Updated 5 years ago
Sense-X / UniFormer
View on GitHub
[ICLR2022] official implementation of UniFormer
☆907Mar 29, 2024Updated 2 years ago
hustvl / MIMDet
View on GitHub
[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
facebookresearch / AVT
View on GitHub
Code release for ICCV 2021 paper "Anticipative Video Transformer"
☆154Feb 11, 2022Updated 4 years ago
facebookresearch / pytorchvideo
View on GitHub
A deep learning library for video understanding research.
☆3,565May 5, 2026Updated 2 months ago
decisionforce / TPN
View on GitHub
[CVPR 2020] Temporal Pyramid Network for Action Recognition
☆394Jan 12, 2021Updated 5 years ago
megvii-research / SOLQ
View on GitHub
"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.
☆200Apr 17, 2022Updated 4 years ago
wjf5203 / SeqFormer
View on GitHub
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
☆350Aug 2, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
yukimasano / single_img_pretraining
View on GitHub
Code for generating a single image pretraining dataset
☆13Aug 3, 2021Updated 4 years ago
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
amazon-science / video-contrastive-learning
View on GitHub
Video Contrastive Learning with Global Context, ICCVW 2021
☆162May 30, 2022Updated 4 years ago
MCG-NJU / TDN
View on GitHub
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
☆386Sep 17, 2022Updated 3 years ago
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆303Mar 10, 2020Updated 6 years ago
Flowerfan / SF-Net
View on GitHub
☆74Jan 27, 2022Updated 4 years ago
NVlabs / MinVIS
View on GitHub
☆276Dec 4, 2024Updated last year
mengyuest / AdaFuse
View on GitHub
[ICLR2021] AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
☆35Apr 8, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rxtan2 / video-grounding-narrations
View on GitHub
☆12Mar 12, 2023Updated 3 years ago
facebookresearch / long_seq_mae
View on GitHub
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Oct 14, 2022Updated 3 years ago
rishikksh20 / ViViT-pytorch
View on GitHub
Implementation of ViViT: A Video Vision Transformer
☆559Jun 21, 2021Updated 5 years ago
fpv-iplab / rulstm
View on GitHub
Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…
☆137Aug 23, 2023Updated 2 years ago
alibaba-mmai-research / TAdaConv
View on GitHub
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…
☆246Aug 23, 2023Updated 2 years ago
TengdaHan / DPC
View on GitHub
Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.
☆256Oct 8, 2021Updated 4 years ago
liu-zhy / temporal-adaptive-module
View on GitHub
TAM: Temporal Adaptive Module for Video Recognition
☆207Aug 18, 2022Updated 3 years ago