Dotori-HJ / TE-TAD
[CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression"
☆19Updated 9 months ago
Alternatives and similar repositories for TE-TAD:
Users that are interested in TE-TAD are comparing it to the libraries listed below
- [CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames☆33Updated 8 months ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆59Updated 2 months ago
- End to End Streaming Video Temporal Segmentation☆25Updated 3 weeks ago
- ☆50Updated last year
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆52Updated 6 months ago
- Official Implementation of SnAG (CVPR 2024)☆44Updated 5 months ago
- ☆28Updated 6 months ago
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆18Updated 9 months ago
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆39Updated 4 months ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆60Updated last year
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆63Updated last year
- ☆23Updated last year
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆31Updated 7 months ago
- ☆38Updated 11 months ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆127Updated 7 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆123Updated last year
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆29Updated last month
- The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)☆40Updated last year
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆50Updated last year
- ☆44Updated last year
- A curated publication list on weakly-supervised temporal action localization☆147Updated last year
- ☆35Updated 5 months ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆96Updated 11 months ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆83Updated 3 weeks ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆41Updated 9 months ago
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆57Updated 2 years ago
- ☆27Updated 5 months ago
- A curated list of awesome temporal action segmentation resources.☆188Updated 11 months ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆88Updated 6 months ago
- ☆21Updated last year