PRIS-CV / Category-Specific-PromptLinks
Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"
☆14Updated last year
Alternatives and similar repositories for Category-Specific-Prompt
Users that are interested in Category-Specific-Prompt are comparing it to the libraries listed below
Sorting:
- ☆26Updated 2 years ago
- Official implementation of TagAlign☆35Updated last year
- ☆22Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆97Updated last year
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Updated last year
- Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models☆21Updated 8 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Updated last year
- ☆85Updated 2 years ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆18Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆28Updated 2 years ago
- ☆18Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆76Updated 2 years ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆30Updated 2 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆69Updated last year
- ☆58Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "☆21Updated 2 years ago
- RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.☆13Updated 2 years ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆69Updated last year
- ☆17Updated 11 months ago
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32Updated 9 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated 2 years ago
- Generating Image Specific Text☆29Updated 2 years ago
- ☆83Updated last year
- Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".☆63Updated 2 months ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Updated 2 years ago
- Turning to Video for Transcript Sorting☆49Updated 2 years ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆49Updated last week
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Updated 6 months ago
- Disentangled Pre-training for Human-Object Interaction Detection☆27Updated 4 months ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Updated 2 years ago