amazon-science / peft-design-spaces
Official implementation for "Parameter-Efficient Fine-Tuning Design Spaces"
☆26Updated 2 years ago
Alternatives and similar repositories for peft-design-spaces:
Users that are interested in peft-design-spaces are comparing it to the libraries listed below
- ☆45Updated 6 months ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆117Updated last month
- The code and data for the paper JiuZhang3.0☆43Updated 10 months ago
- Released code for our ICLR23 paper.☆64Updated 2 years ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆36Updated 11 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 9 months ago
- ☆61Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆39Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆46Updated 3 months ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆82Updated last year
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆68Updated 5 months ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated 2 years ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆31Updated last year
- ☆49Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 8 months ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 8 months ago
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆24Updated 8 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆30Updated 3 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 4 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- ☆10Updated 10 months ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- ☆98Updated 6 months ago
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆45Updated 2 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆30Updated last year
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated last year