PRIS-CV / Category-Specific-PromptLinks
Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"
☆12Updated last year
Alternatives and similar repositories for Category-Specific-Prompt
Users that are interested in Category-Specific-Prompt are comparing it to the libraries listed below
Sorting:
- ☆19Updated 8 months ago
- Disentangled Pre-training for Human-Object Interaction Detection☆25Updated 2 weeks ago
- ☆23Updated 2 years ago
- Official implementation of TagAlign☆35Updated 7 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆83Updated 5 months ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆67Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆35Updated 2 months ago
- ☆58Updated last year
- ☆117Updated last year
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Updated last year
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆13Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆30Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆53Updated last year
- Turning to Video for Transcript Sorting☆48Updated last year
- [ICCV 2023] PyTorch implementation of RandBox☆53Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- ☆30Updated last year
- ☆14Updated 4 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated 11 months ago
- Tracking with Human-Intent Reasoning☆71Updated 8 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆76Updated last year
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆37Updated last year
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆21Updated 5 months ago
- Official code of ACM MM2024 paper- Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection☆23Updated 10 months ago
- Video Feature Enhancement with PyTorch☆31Updated 7 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆31Updated 7 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 8 months ago