cs-mshah / Adapter-BertLinks
Paper Implementation for "Parameter-Efficient Transfer Learning for NLP"
☆15Updated last year
Alternatives and similar repositories for Adapter-Bert
Users that are interested in Adapter-Bert are comparing it to the libraries listed below
Sorting:
- ☆69Updated 3 years ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆85Updated 6 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆72Updated 2 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆102Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆83Updated 7 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆143Updated last year
- ☆139Updated last month
- ☆137Updated 11 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Updated last year
- A Survey on Data Selection for Language Models☆237Updated last month
- ☆155Updated 3 years ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆45Updated 8 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆32Updated last year
- ☆183Updated last year
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆75Updated last year
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Updated 2 years ago
- ☆44Updated 3 months ago
- awesome SAE papers☆35Updated last month
- This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).☆46Updated 2 years ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆112Updated last year
- Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models☆142Updated 2 years ago
- ☆13Updated last year
- The Paper List on Data Contamination for Large Language Models Evaluation.☆95Updated 2 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated last year
- ☆172Updated last year
- ☆129Updated 2 years ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆59Updated 3 months ago
- A curated list of Model Merging methods.☆92Updated 9 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆102Updated 3 months ago
- ☆49Updated last year