This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.12410).
☆138Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for AdaMix
Users that are interested in AdaMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)☆51Mar 17, 2024Updated 2 years ago
- ☆15Oct 30, 2021Updated 4 years ago
- This package implements THOR: Transformer with Stochastic Experts.☆64Oct 7, 2021Updated 4 years ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆64Mar 23, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆239Dec 8, 2023Updated 2 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆456Sep 6, 2023Updated 2 years ago
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆204May 4, 2024Updated 2 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆104Dec 1, 2022Updated 3 years ago
- Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…☆52Oct 8, 2023Updated 2 years ago
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆544Mar 24, 2022Updated 4 years ago
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆242Jan 20, 2023Updated 3 years ago
- Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".☆28Nov 23, 2020Updated 5 years ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Aug 14, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Jul 1, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆93Jul 25, 2023Updated 2 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,812Apr 26, 2026Updated last week
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆130Apr 23, 2022Updated 4 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- lecture video repository with some comments or time stamps☆16Jun 8, 2019Updated 6 years ago
- ☆67May 18, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆35Mar 2, 2023Updated 3 years ago
- ☆131Aug 18, 2022Updated 3 years ago
- ☆277Oct 31, 2023Updated 2 years ago
- ☆34Aug 5, 2023Updated 2 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- ☆13Feb 2, 2023Updated 3 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- ☆177Jul 24, 2024Updated last year
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆940Oct 6, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Codes for paper: Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT☆34May 27, 2022Updated 3 years ago
- Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NL…☆18May 4, 2022Updated 4 years ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,042Sep 19, 2024Updated last year
- ☆11Jan 3, 2023Updated 3 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆474Apr 21, 2024Updated 2 years ago