Code release for ICCV 2021 paper "Anticipative Video Transformer"
☆154Feb 11, 2022Updated 4 years ago
Alternatives and similar repositories for AVT
Users that are interested in AVT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆78Jan 5, 2024Updated 2 years ago
- [ECCV 2020] Temporal Aggregate Representations for Long-Range Video Understanding☆11Sep 13, 2021Updated 4 years ago
- Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…☆134Aug 23, 2023Updated 2 years ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆48Dec 22, 2022Updated 3 years ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- ☆19Sep 10, 2021Updated 4 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆32Aug 15, 2023Updated 2 years ago
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆168Aug 1, 2022Updated 3 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 2 years ago
- ☆12Apr 6, 2023Updated 2 years ago
- ☆78Aug 16, 2021Updated 4 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆29Sep 23, 2024Updated last year
- ☆25Nov 22, 2019Updated 6 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- ☆35Mar 22, 2022Updated 4 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆50Oct 7, 2023Updated 2 years ago
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) f…☆24Jan 22, 2023Updated 3 years ago
- Omnivore: A Single Model for Many Visual Modalities☆572Nov 12, 2022Updated 3 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 4 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆233Jun 13, 2022Updated 3 years ago
- Download scripts for EPIC-KITCHENS☆163Jul 8, 2025Updated 8 months ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset☆549Mar 14, 2026Updated last week
- Official PyTorch implementation of Higher Order Recurrent Space-Time Transformer and Higher-Order Recurrent Network with Space-Time Atten…☆17Feb 16, 2026Updated last month
- Code release for "Learning Video Representations from Large Language Models"☆534Oct 1, 2023Updated 2 years ago
- [MedIA'22] Anticipation for surgical workflow through instrument interaction and recognized signals☆17Feb 11, 2022Updated 4 years ago
- [ECCV 2022] Tackling Long-Tailed Category Distribution Under Domain Shifts☆25Nov 29, 2022Updated 3 years ago
- ☆10Jul 14, 2023Updated 2 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- This is the pytorch version of tcc loss, used in paper 'Temporal Cycle-Consistency Learning'.☆27Oct 11, 2020Updated 5 years ago
- ☆193Oct 22, 2022Updated 3 years ago
- Implementation of ViViT: A Video Vision Transformer☆557Jun 21, 2021Updated 4 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆153Nov 30, 2022Updated 3 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆117Sep 15, 2022Updated 3 years ago
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆117Aug 23, 2025Updated 7 months ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆114Aug 3, 2023Updated 2 years ago
- Code for the HowTo100M paper☆298Mar 10, 2020Updated 6 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆35Jun 8, 2021Updated 4 years ago