ZijiaLewisLu / CVPR2024-FACTView external linksLinks
Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentation"
☆84Jan 23, 2026Updated 3 weeks ago
Alternatives and similar repositories for CVPR2024-FACT
Users that are interested in CVPR2024-FACT are comparing it to the libraries listed below
Sorting:
- Code for Diffusion Action Segmentation (ICCV 2023)☆73Aug 16, 2023Updated 2 years ago
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆28Sep 9, 2024Updated last year
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆50Jun 21, 2024Updated last year
- Official repo for BMVC2021 paper ASFormer: Transformer for action segmentation☆134Feb 19, 2022Updated 3 years ago
- The official implementation of Error Detection in Egocentric Procedural Task Videos☆21Sep 20, 2025Updated 4 months ago
- ☆10Jan 26, 2025Updated last year
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆101Oct 30, 2022Updated 3 years ago
- ☆30Jan 29, 2020Updated 6 years ago
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆34Feb 5, 2024Updated 2 years ago
- Replace the MS-TCN with ASFormer in asrf☆22Oct 28, 2021Updated 4 years ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Code for ''Alleviating Over-segmentation Errors by Detecting Action Boundaries'' accepted in WACV2021☆62Apr 26, 2023Updated 2 years ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆69Sep 11, 2024Updated last year
- ☆24Mar 24, 2023Updated 2 years ago
- [ECCV 2024] EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation☆18Apr 23, 2025Updated 9 months ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- ☆252Nov 13, 2023Updated 2 years ago
- Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty Estimation (ICML 2024)☆15Jul 19, 2024Updated last year
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Sep 17, 2022Updated 3 years ago
- ☆13Jul 22, 2025Updated 6 months ago
- Code for I3D Feature Extraction☆160Aug 7, 2019Updated 6 years ago
- ☆15Oct 11, 2021Updated 4 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Inception-I3D, Non Local finetune, hmdb51_flow☆15Oct 15, 2019Updated 6 years ago
- ☆17Sep 19, 2025Updated 4 months ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆19Mar 13, 2024Updated last year
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆70Jan 4, 2026Updated last month
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated last year
- ☆18Mar 1, 2024Updated last year
- ☆15Apr 3, 2023Updated 2 years ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 9 months ago
- This is an official repository of paper "Refining Action Segmentation with Hierarchical Video Representations", which is accepted as a re…☆17Oct 11, 2021Updated 4 years ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- [CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)☆156Feb 20, 2021Updated 4 years ago
- ☆35May 24, 2019Updated 6 years ago