Dotori-HJ / TE-TAD
[CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression"
☆17Updated 6 months ago
Alternatives and similar repositories for TE-TAD:
Users that are interested in TE-TAD are comparing it to the libraries listed below
- ☆23Updated 3 months ago
- [CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames☆32Updated 6 months ago
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆27Updated 5 months ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆43Updated 2 weeks ago
- Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"☆16Updated last year
- ☆47Updated last year
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆39Updated 2 months ago
- Official Implementation of "The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval"☆72Updated last month
- ☆30Updated 2 months ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆49Updated last year
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆61Updated 11 months ago
- End to End Streaming Video Temporal Segmentation☆23Updated 7 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆115Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆58Updated last year
- A curated list of awesome self-supervised learning methods in videos☆123Updated last week
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆14Updated 2 months ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆48Updated 7 months ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆121Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆44Updated 3 months ago
- ☆26Updated 4 months ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆262Updated 9 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆27Updated last year
- A curated list of awesome temporal action segmentation resources.☆173Updated 9 months ago
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆49Updated 4 months ago
- This is an official implementation for "Block Selection Method for Using Feature Norm in Out-of-distribution Detection", CVPR 2023.☆22Updated 8 months ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆49Updated 2 years ago
- ☆23Updated 3 months ago
- ☆14Updated last year
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆33Updated 9 months ago
- ☆23Updated last year