Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
☆24Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for MSQNet
Users that are interested in MSQNet are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆89Feb 25, 2025Updated last year
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆112Aug 3, 2023Updated 2 years ago
- Implementation of the dataset defined in Spiking Neural Networks for event-based action recognition: A new task to understand their adva…☆15Aug 9, 2023Updated 2 years ago
- Simple vispy-based frame-by-frame behavior annotation GUI. Also can display DeepLabCut poses within videos during annotation.☆15Sep 17, 2025Updated 5 months ago
- 🏆 The 1st Place Solution for AICity2022 Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆12Jun 28, 2022Updated 3 years ago
- [AAAI 2025] Official code for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"☆21Sep 30, 2025Updated 5 months ago
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆19Mar 13, 2024Updated last year
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- ☆42Apr 7, 2024Updated last year
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Oct 19, 2022Updated 3 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆103Nov 6, 2024Updated last year
- This is the offical repository of LLAVIDAL☆23Oct 4, 2025Updated 4 months ago
- ☆21May 11, 2025Updated 9 months ago
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆154Sep 9, 2024Updated last year
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 11 months ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Jul 22, 2023Updated 2 years ago
- Official pytorch implementation for PSUMNet for efficient skeleton action recognition☆30Mar 17, 2023Updated 2 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- ☆120Feb 19, 2024Updated 2 years ago
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆27Jan 3, 2023Updated 3 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆69Feb 3, 2023Updated 3 years ago
- Unified Negative Pair Generation toward Well-discriminative Feature Space for Face Recognition☆34Mar 14, 2022Updated 3 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134May 21, 2023Updated 2 years ago
- [ECCV 2024] Official code release for "Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition"☆41Mar 24, 2025Updated 11 months ago
- Repository of the paper "Reconstruction of Time-Varying Graph Signals via Sobolev Smoothness" published in IEEE T-SIPN☆13Mar 3, 2022Updated 3 years ago
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆37Jul 19, 2024Updated last year
- PyTorch implementation of "Detecting 32 Pedestrian Attributes for Autonomous Vehicles"☆33Oct 16, 2021Updated 4 years ago
- [ACM MM '24 Poster] Official repository of paper titled "Towards Robustness Prompt Tuning with Fully Test-Time Adaptation for CLIP’s Zero…☆10Aug 6, 2024Updated last year
- Code for "Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via Optimization Trajectory Distillation", ICCV 2023☆16Aug 31, 2023Updated 2 years ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆149Aug 21, 2024Updated last year
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Aug 16, 2023Updated 2 years ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆38Sep 27, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 11 months ago
- The second generation of YOWO action detector.☆276May 9, 2024Updated last year