linziyi96 / st-adapterLinks
☆85Updated 2 years ago
Alternatives and similar repositories for st-adapter
Users that are interested in st-adapter are comparing it to the libraries listed below
Sorting:
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆127Updated 2 years ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆95Updated last year
- ☆42Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated 2 years ago
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆104Updated 2 years ago
- ☆49Updated 3 years ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆75Updated 2 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆78Updated last year
- ☆30Updated 2 years ago
- ☆120Updated last year
- ☆26Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Updated 2 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆56Updated 2 years ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆55Updated 2 years ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆69Updated last year
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆64Updated last year
- ☆105Updated 2 years ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆106Updated 2 years ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆72Updated 2 years ago
- ☆95Updated 2 years ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆109Updated 2 months ago
- ☆39Updated 2 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆51Updated 3 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆50Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆175Updated 2 years ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆110Updated last year
- ☆62Updated 2 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Updated last year
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆97Updated 3 months ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Updated 2 years ago