Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
☆117Aug 23, 2025Updated 6 months ago
Alternatives and similar repositories for TeSTra
Users that are interested in TeSTra are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆49Oct 7, 2023Updated 2 years ago
- [NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection☆141Jul 25, 2024Updated last year
- ☆18Jul 26, 2023Updated 2 years ago
- ☆40May 7, 2024Updated last year
- SimOn: A Simple Framework for Online Temporal Action Localization☆22Nov 12, 2022Updated 3 years ago
- Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".☆97Jul 16, 2023Updated 2 years ago
- ☆11Aug 7, 2024Updated last year
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆138Aug 23, 2025Updated 6 months ago
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆572Jan 30, 2026Updated last month
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆112Aug 3, 2023Updated 2 years ago
- [ICCV 2019] Official implementation of Temporal Recurrent Networks for Online Action Detection☆85Jul 21, 2022Updated 3 years ago
- Implementation of "Temporal Recurrent Networks for Online Action Detection"☆23May 6, 2019Updated 6 years ago
- ☆22Mar 7, 2025Updated last year
- Code for "Long-tail Detection with Effective Class-Margins." (ECCV 2022 Oral)☆63Sep 2, 2023Updated 2 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- [AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation☆17Nov 13, 2022Updated 3 years ago
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Oct 3, 2024Updated last year
- ☆14Jul 14, 2025Updated 7 months ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆22Apr 10, 2025Updated 11 months ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆90Apr 14, 2023Updated 2 years ago
- Code for Few-View Object Reconstruction with Unknown Categories and Camera Poses at 3DV 2024 (oral)☆93Jan 23, 2024Updated 2 years ago
- [ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization☆201Dec 18, 2025Updated 2 months ago
- Code release for "Learning Video Representations from Large Language Models"☆536Oct 1, 2023Updated 2 years ago
- Code release for ActionFormer (ECCV 2022)☆544Apr 11, 2024Updated last year
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆29Sep 23, 2024Updated last year
- ☆193Oct 22, 2022Updated 3 years ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- Global Tracking Transformers, CVPR 2022☆379Aug 2, 2022Updated 3 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- [NeurIPS 2022 Spotlight] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator☆30Oct 3, 2022Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,689Dec 8, 2023Updated 2 years ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆50Jun 21, 2024Updated last year
- All about FineGym (CVPR 2020 Oral): models, features, data, and more... keep starring and stay tuned!☆153Dec 26, 2024Updated last year
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆182Oct 19, 2021Updated 4 years ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago