amazon-science / peft-design-spaces
Official implementation for "Parameter-Efficient Fine-Tuning Design Spaces"
β26Updated last year
Related projects β
Alternatives and complementary repositories for peft-design-spaces
- 𧬠RegMix: Data Mixture as Regression for Language Model Pre-trainingβ88Updated last month
- β46Updated 2 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Modelsβ43Updated 5 months ago
- Touchstone: Evaluating Vision-Language Models by Language Modelsβ78Updated 10 months ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuningβ29Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Modelsβ73Updated 8 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversationβ48Updated last year
- β84Updated 11 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ146Updated 5 months ago
- β38Updated 5 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"β73Updated 9 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learningβ38Updated last year
- β88Updated last month
- Source code for the paper "Prefix Language Models are Unified Modal Learners"β43Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":β35Updated 7 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ36Updated 8 months ago
- Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memoriβ¦β45Updated last year
- Released code for our ICLR23 paper.β63Updated last year
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022β58Updated 2 years ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteriaβ55Updated last month
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".β32Updated last year
- Official completion of βTraining on the Benchmark Is Not All You Needβ.β26Updated 2 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Modelsβ67Updated 4 months ago
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.β69Updated last month
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMsβ75Updated last month
- [EMNLP 2023 Main] Sparse Low-rank Adaptation of Pre-trained Language Modelsβ69Updated 8 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.β51Updated 3 weeks ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)β29Updated 7 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language modelβ25Updated last week
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"β36Updated last year