[NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model
☆90Nov 28, 2023Updated 2 years ago
Alternatives and similar repositories for Aurora
Users that are interested in Aurora are comparing it to the libraries listed below
Sorting:
- ☆26Mar 20, 2023Updated 2 years ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆50May 12, 2024Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆412Sep 26, 2024Updated last year
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Dec 12, 2022Updated 3 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- Multi-head Recurrent Layer Attention for Vision Network☆22Mar 2, 2023Updated 3 years ago
- [ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"☆102Apr 10, 2024Updated last year
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆71Jan 19, 2024Updated 2 years ago
- ☆34Nov 12, 2023Updated 2 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Nov 4, 2023Updated 2 years ago
- [ICLR 2023] Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning"☆60Jun 6, 2023Updated 2 years ago
- ☆61May 2, 2025Updated 10 months ago
- [Pattern Recognition 2025] Cross-Modal Adapter for Vision-Language Retrieval☆140Aug 17, 2025Updated 6 months ago
- ☆22Dec 9, 2022Updated 3 years ago
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆806Jul 24, 2023Updated 2 years ago
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated 11 months ago
- A much powerful probing method to tune your model with promising performance and linear probing training cost!☆15Jul 26, 2023Updated 2 years ago
- This is a repository for the ICLR2023 accepted paper -- Medical Image Understanding with Pretrained Vision Language Models: A Comprehensi…☆71Jun 9, 2023Updated 2 years ago
- ☆34Aug 23, 2023Updated 2 years ago
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated 10 months ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆46Mar 10, 2023Updated 2 years ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- Awesome Self-Supervised Vision Learning☆11Mar 27, 2024Updated last year
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- Demo scripts for HPS Dataset (http://virtualhumans.mpi-inf.mpg.de/hps/)☆11Mar 10, 2025Updated 11 months ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- The efficient tuning method for VLMs☆81Mar 10, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- [NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"☆525Jan 27, 2024Updated 2 years ago
- The offical implemention of JM3D.☆31Aug 18, 2025Updated 6 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆53Sep 21, 2023Updated 2 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆347Dec 14, 2025Updated 2 months ago
- ☆55Dec 13, 2023Updated 2 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆85Mar 21, 2024Updated last year
- ☆24Jun 18, 2025Updated 8 months ago