Code release for ICCV 2021 paper "Anticipative Video Transformer"
☆154Feb 11, 2022Updated 4 years ago
Alternatives and similar repositories for AVT
Users that are interested in AVT are comparing it to the libraries listed below
Sorting:
- ☆78Jan 5, 2024Updated 2 years ago
- [ECCV 2020] Temporal Aggregate Representations for Long-Range Video Understanding☆11Sep 13, 2021Updated 4 years ago
- Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…☆134Aug 23, 2023Updated 2 years ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆48Dec 22, 2022Updated 3 years ago
- ☆12Apr 6, 2023Updated 2 years ago
- ☆19Sep 10, 2021Updated 4 years ago
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆165Aug 1, 2022Updated 3 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆32Aug 15, 2023Updated 2 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆29Sep 23, 2024Updated last year
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 2 years ago
- ☆34Mar 22, 2022Updated 3 years ago
- ☆78Aug 16, 2021Updated 4 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆49Oct 7, 2023Updated 2 years ago
- Code release for "Learning Video Representations from Large Language Models"☆536Oct 1, 2023Updated 2 years ago
- ☆25Nov 22, 2019Updated 6 years ago
- Omnivore: A Single Model for Many Visual Modalities☆571Nov 12, 2022Updated 3 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆233Jun 13, 2022Updated 3 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 3 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- Download scripts for EPIC-KITCHENS☆162Jul 8, 2025Updated 7 months ago
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) f…☆24Jan 22, 2023Updated 3 years ago
- [ECCV 2022] Tackling Long-Tailed Category Distribution Under Domain Shifts☆25Nov 29, 2022Updated 3 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 3 years ago
- ☆193Oct 22, 2022Updated 3 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆112Aug 3, 2023Updated 2 years ago
- SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)☆351Aug 2, 2022Updated 3 years ago
- ☆10Jul 14, 2023Updated 2 years ago
- Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset☆537Feb 19, 2026Updated last week
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆182Oct 19, 2021Updated 4 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 4 years ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆138Aug 23, 2025Updated 6 months ago
- ☆13Jul 20, 2024Updated last year
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Dec 30, 2021Updated 4 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- Code for the HowTo100M paper☆293Mar 10, 2020Updated 5 years ago