A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Aug 9, 2024Updated last year
Alternatives and similar repositories for composable-sft
Users that are interested in composable-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆59Jan 14, 2022Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 3 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 5 years ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆64Mar 23, 2022Updated 4 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Oct 19, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆54May 8, 2023Updated 3 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆74Feb 3, 2021Updated 5 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 3 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- Ongoing research training transformer models at scale☆19Jul 27, 2023Updated 2 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Jun 23, 2023Updated 2 years ago
- ☆15May 10, 2021Updated 5 years ago
- 🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models [BabyLM Challenge]☆17Jan 10, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆22Oct 26, 2020Updated 5 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- ☆12Jun 23, 2023Updated 2 years ago
- Source code for the NAACL 2021 paper: Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation☆15Jul 19, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- Reproducing RigL (ICML 2020) as a part of ML Reproducibility Challenge 2020☆29Jan 6, 2022Updated 4 years ago
- Polish data.☆13May 6, 2026Updated last month
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- r4c☆14Mar 2, 2021Updated 5 years ago
- ☆21Oct 31, 2022Updated 3 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆37Aug 31, 2021Updated 4 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Apr 28, 2022Updated 4 years ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆46Feb 28, 2026Updated 3 months ago
- ☆21Jul 5, 2024Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks☆20Mar 26, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Versatile Generative Language Model☆25Oct 29, 2022Updated 3 years ago
- [ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers☆26Jun 7, 2023Updated 3 years ago
- Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"☆21Nov 4, 2022Updated 3 years ago
- Course materials for a 3-day seminar "Machine Learning and NLP: Advances and Applications" at New College of Florida☆12Feb 10, 2022Updated 4 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Official repository of the R2-D2's pipeline☆21Nov 16, 2021Updated 4 years ago
- ☆11Jan 10, 2020Updated 6 years ago