SamsungLabs / StepFormerLinks

☆16

Alternatives and similar repositories for StepFormer

Users that are interested in StepFormer are comparing it to the libraries listed below

Sorting:

Chuhanxx / helping_hand_for_egocentric_videos
Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'
☆33Updated last year
facebookresearch / VidOSC
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
☆33Updated 8 months ago
Sid2697 / EgoProceL-egocentric-procedure-learning
Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"
☆28Updated last year
facebookresearch / EgoVLPv2
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆98Updated 11 months ago
TengdaHan / TemporalAlignNet
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆118Updated last year
zihuixue / AlignEgoExo
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆17Updated last year
facebookresearch / htstep
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆18Updated last year
fpv-iplab / EASG
Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)
☆39Updated last month
Chuhanxx / Temporal_Query_Networks
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆62Updated 3 years ago
houzhijian / GroundNLQ
The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023
☆17Updated last year
salesforce / paprika
Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"
☆49Updated 4 months ago
robert80203 / EgoPER_official
The official implementation of Error Detection in Egocentric Procedural Task Videos
☆16Updated 9 months ago
fpv-iplab / stillfast
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆11Updated 2 years ago
facebookresearch / ego4d-goalstep
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆42Updated last year
srama2512 / NaQ
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.
☆15Updated last year
Echo0125 / MAT-Memory-and-Anticipation-Transformer
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
☆45Updated last year
zhaoyue-zephyrus / AVION
[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"
☆129Updated 10 months ago
ttlmh / Bridge-Prompt
[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
☆98Updated 2 years ago
antoyang / TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆180Updated last year
NNNNAI / Ego4d_NLQ_2022_1st_Place_Solution
The 1st place solution of 2022 Ego4d Natural Language Queries.
☆32Updated 2 years ago
EGO4D / episodic-memory
☆120Updated last year
Nmegha2601 / anticipatr
☆11Updated 2 years ago
MCG-NJU / TRACE
[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation
☆58Updated 2 years ago
facebookresearch / Ego-Exo
Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)
☆33Updated 3 years ago
OpenGVLab / EgoExoLearn
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆59Updated 9 months ago
facebookresearch / video-distant-supervision
This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…
☆42Updated 2 years ago
lbaermann / qaego4d
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆25Updated last year
OpenGVLab / EgoVideo
[CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024
☆127Updated 3 weeks ago
tsujuifu / pytorch_empirical-mvm
A PyTorch implementation of EmpiricalMVM
☆41Updated last year
gongda0e / FUTR
Future Transformer for Long-term Action Anticipation (CVPR 2022)
☆49Updated 2 years ago