Richard-61 / FineAction
The official codebase of FineAction dataset. We will update the data and code of our FineAction.
☆16Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for FineAction
- ☆50Updated last year
- SimOn: A Simple Framework for Online Temporal Action Localization☆18Updated 2 years ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆47Updated 2 months ago
- ☆17Updated 7 months ago
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Updated last month
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆21Updated 4 months ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆49Updated last year
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆27Updated last year
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆98Updated 9 months ago
- ☆54Updated 4 months ago
- ☆31Updated 6 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- ☆15Updated last year
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆42Updated 4 months ago
- The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023☆16Updated 9 months ago
- Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.☆51Updated 2 months ago
- Temporal Action Localization Visualization Tool (TALVT) is a Javascript based simple visualization tool to visualize the outcomes of the …☆27Updated 4 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆22Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"☆27Updated 9 months ago
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆15Updated 9 months ago
- [ICCV 2023] Accurate and Fast Compressed Video Captioning☆34Updated 9 months ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆31Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆40Updated 11 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆90Updated 4 months ago
- Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆39Updated last year
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆48Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- ☆41Updated 4 months ago
- [ICCV 2023] The official PyTorch implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance"☆17Updated last month
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…☆27Updated 7 months ago