Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
☆24Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for MSQNet
Users that are interested in MSQNet are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆90Feb 25, 2025Updated last year
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆114Aug 3, 2023Updated 2 years ago
- MoSeq2 Jupyter Notebook platform used to run all of the MoSeq2 tools in a GUI.☆40Feb 18, 2026Updated last month
- [AAAI 2025] Official code for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"☆21Sep 30, 2025Updated 5 months ago
- Github repo for referring atomic video action recognition☆20Oct 2, 2024Updated last year
- 🏆 The 1st Place Solution for AICity2022 Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆12Jun 28, 2022Updated 3 years ago
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆19Mar 13, 2024Updated 2 years ago
- ☆13Sep 2, 2023Updated 2 years ago
- Qt/Qml application using Google speech-to-text API to make voice commands☆11Jan 19, 2020Updated 6 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Oct 19, 2022Updated 3 years ago
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆78Mar 7, 2024Updated 2 years ago
- [CVPR 2024] Official code for paper: Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection.☆26Aug 19, 2024Updated last year
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆154Sep 9, 2024Updated last year
- ☆119Feb 19, 2024Updated 2 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- ☆14Sep 2, 2020Updated 5 years ago
- 【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective☆198May 30, 2024Updated last year
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Aug 16, 2023Updated 2 years ago
- Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection" (IEEE-TIP)☆103Aug 23, 2024Updated last year
- CVPR2023☆18Mar 18, 2023Updated 3 years ago
- [ACM MM '24 Poster] Official repository of paper titled "Towards Robustness Prompt Tuning with Fully Test-Time Adaptation for CLIP’s Zero…☆10Aug 6, 2024Updated last year
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆35Mar 24, 2025Updated 11 months ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆103Nov 6, 2024Updated last year
- GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?☆185May 22, 2024Updated last year
- ☆182Aug 20, 2022Updated 3 years ago
- [ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts☆14Jan 13, 2025Updated last year
- Efficient and general implementation of Generalized Mean Pooling (GeM)☆11Jul 2, 2024Updated last year
- Official source code of the paper: Perturbation Seeking Generative Adversarial Networks: A Defense Framework for Remote Sensing Image Sce…☆14Jan 6, 2022Updated 4 years ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 3 months ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Jul 22, 2023Updated 2 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆603Dec 6, 2023Updated 2 years ago
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆24Sep 9, 2025Updated 6 months ago
- [ECCV 2024] Official code release for "Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition"☆42Mar 24, 2025Updated 11 months ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆303Apr 3, 2024Updated last year