bruceyo / V-PETL
Towards a Unified View on Visual Parameter-Efficient Transfer Learning
☆26Updated last year
Related projects: ⓘ
- Turning to Video for Transcript Sorting☆44Updated last year
- ☆21Updated last year
- ☆20Updated 11 months ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆41Updated 3 months ago
- ☆55Updated last year
- [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences☆17Updated 3 weeks ago
- The official code of paper "Automated Multi-level Preference for MLLMs"☆15Updated 3 weeks ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆28Updated last year
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆26Updated 2 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- ☆25Updated 7 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆36Updated 9 months ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆45Updated 10 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆35Updated last month
- FreeVA: Offline MLLM as Training-Free Video Assistant☆42Updated 3 months ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆53Updated 10 months ago
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Updated 2 years ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆41Updated 4 months ago
- ☆32Updated last year
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- [CVPR2022] PyTorch re-implementation of Prompt Distribution Learning☆14Updated last year
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆54Updated last year
- ☆25Updated last year
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆51Updated 11 months ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆36Updated last year
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆23Updated 6 months ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆60Updated 11 months ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated 8 months ago