ylsung / Ladder-Side-Tuning
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
☆232Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Ladder-Side-Tuning
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆202Updated last year
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆516Updated 2 years ago
- Recent Advances in Vision and Language Pre-training (VLP)☆288Updated last year
- [NeurIPS'22] This is an official implementation for "Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning".☆173Updated last year
- All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)☆141Updated 2 months ago
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆222Updated 11 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆182Updated 7 months ago
- ☆152Updated 3 years ago
- MixGen: A New Multi-Modal Data Augmentation☆116Updated last year
- Dataset pruning for ImageNet and LAION-2B.☆69Updated 4 months ago
- SVIT: Scaling up Visual Instruction Tuning☆163Updated 5 months ago
- ☆147Updated 4 months ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆58Updated 2 years ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆145Updated 7 months ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆133Updated last year
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆260Updated 10 months ago
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆391Updated last month
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))☆87Updated last year
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆29Updated 7 months ago
- METER: A Multimodal End-to-end TransformER Framework☆362Updated 2 years ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆130Updated 2 years ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆72Updated 7 months ago
- Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".☆83Updated 2 years ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆79Updated 9 months ago
- ☆76Updated 4 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆171Updated last year
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆192Updated last year
- A RLHF Infrastructure for Vision-Language Models☆104Updated this week
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆260Updated last month