Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos
☆30Sep 9, 2024Updated last year
Alternatives and similar repositories for ProTAS
Users that are interested in ProTAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of Error Detection in Egocentric Procedural Task Videos☆22Sep 20, 2025Updated 6 months ago
- ☆15Feb 3, 2025Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆89Jan 23, 2026Updated 2 months ago
- ☆23Aug 19, 2024Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆73Aug 16, 2023Updated 2 years ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆49Jun 21, 2024Updated last year
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆31Jun 9, 2025Updated 9 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"☆19Aug 23, 2024Updated last year
- Replace the MS-TCN with ASFormer in asrf☆22Oct 28, 2021Updated 4 years ago
- A curated list of awesome temporal action segmentation resources.☆246Apr 4, 2024Updated last year
- Python scripts to download Assembly101 from Google Drive☆67Oct 10, 2024Updated last year
- Code for the paper "Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric V…☆21Jan 9, 2025Updated last year
- Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation☆11Jul 24, 2023Updated 2 years ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated last year
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆102Oct 30, 2022Updated 3 years ago
- ☆40May 7, 2024Updated last year
- Code for ''Alleviating Over-segmentation Errors by Detecting Action Boundaries'' accepted in WACV2021☆63Apr 26, 2023Updated 2 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Feb 21, 2023Updated 3 years ago
- 🔥🔥🔥 Object State Description & Change Detection☆10Mar 30, 2024Updated last year
- ☆148Mar 4, 2019Updated 7 years ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆40Jan 27, 2026Updated last month
- ☆15May 23, 2023Updated 2 years ago
- [ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos☆20Aug 21, 2025Updated 7 months ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- ☆24Mar 24, 2023Updated 3 years ago
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆25Mar 20, 2024Updated 2 years ago
- Understand what physics/algorithms do transformers learn internally when trained on planetary motion☆39Feb 9, 2026Updated last month
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆28Nov 25, 2024Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆50Oct 7, 2023Updated 2 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- [CVPR 2024] Solving Masked Jigsaw Puzzles with Diffusion Vision Transformers☆28Mar 8, 2025Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- ☆48Sep 22, 2023Updated 2 years ago
- Online Product Reviews for Affordances☆24Dec 12, 2018Updated 7 years ago
- ☆18Jul 26, 2023Updated 2 years ago
- ☆20Jul 28, 2025Updated 7 months ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆29Sep 23, 2024Updated last year