Dotori-HJ / DiGITView external linksLinks
[CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer"
☆26Jul 1, 2025Updated 7 months ago
Alternatives and similar repositories for DiGIT
Users that are interested in DiGIT are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆27Jun 26, 2024Updated last year
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆317Apr 29, 2025Updated 9 months ago
- ☆16May 7, 2025Updated 9 months ago
- This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …☆22Aug 18, 2025Updated 5 months ago
- Official Repository of 'Multi-Scale Temporal Mamba for Efficient Temporal Action Detection'☆35Jan 23, 2026Updated 3 weeks ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆205Dec 27, 2023Updated 2 years ago
- Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability☆28Mar 25, 2024Updated last year
- ☆29Jul 4, 2024Updated last year
- [AAAI-25]Code for SEAL☆15Sep 25, 2025Updated 4 months ago
- [ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"☆37Mar 30, 2023Updated 2 years ago
- The code and data of paper “CausalMTA: Eliminating the User Confounding Bias for Causal Multi-touch Attribution” KDD 2023☆10May 10, 2023Updated 2 years ago
- [ICCV 2025] "Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning".☆15Dec 11, 2025Updated 2 months ago
- [TCSVT 2024] Official implementation of the paper: Benchmarking Micro-action Recognition: Dataset, Methods, and Applications☆48Aug 11, 2025Updated 6 months ago
- ☆10May 22, 2022Updated 3 years ago
- [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"☆42Mar 18, 2024Updated last year
- ☆10Oct 13, 2024Updated last year
- ☆18Jul 17, 2025Updated 7 months ago
- A curated publication list on weakly-supervised temporal action localization☆156Nov 27, 2023Updated 2 years ago
- [ACM MM 2025] This repository is the official implementation of the paper "Motion Matters: Motion-guided Modulation Network for Skeleton-…☆20Nov 28, 2025Updated 2 months ago
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- B 站经验助手☆17Jun 12, 2025Updated 8 months ago
- ☆11Aug 9, 2022Updated 3 years ago
- ☆48Sep 22, 2023Updated 2 years ago
- The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)☆42Jun 1, 2023Updated 2 years ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- ☆15Feb 3, 2025Updated last year
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆14Nov 17, 2024Updated last year
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆11Feb 10, 2020Updated 6 years ago
- Notionに毎日新しいarXiv論文のアブストラクト日本語訳 + αを表示するスクリプト☆13Jan 22, 2023Updated 3 years ago
- An open-source implementaion for fine-tuning DINOv2 by Meta.☆13Jul 21, 2025Updated 6 months ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆54Sep 4, 2023Updated 2 years ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆55Jul 5, 2024Updated last year
- This is the official code repository of our dataset and ECCV 2024 paper entitled "Oulu Remote-photoplethysmography Physical Domain Attac…☆13Jul 9, 2025Updated 7 months ago
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated 11 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆14Jul 26, 2024Updated last year
- Pytorch implementation of SphereGAN(Sphere Generative Adversarial Network Based on Geometric Moment Matching)☆15Jul 2, 2019Updated 6 years ago
- Med-DANet Series (ECCV 2022 & WACV 2024)☆13Jan 2, 2024Updated 2 years ago