A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Aug 9, 2024Updated last year
Alternatives and similar repositories for composable-sft
Users that are interested in composable-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆59Jan 14, 2022Updated 4 years ago
- ☆13Dec 11, 2021Updated 4 years ago
- ☆25Jul 12, 2022Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- phone inventory library☆17May 15, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 5 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆43Mar 16, 2021Updated 5 years ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆64Mar 23, 2022Updated 4 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Oct 19, 2022Updated 3 years ago
- ☆54May 8, 2023Updated 3 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆74Feb 3, 2021Updated 5 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 3 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ongoing research training transformer models at scale☆18Jul 27, 2023Updated 2 years ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- ☆15May 10, 2021Updated 5 years ago
- 🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models [BabyLM Challenge]☆17Jan 10, 2025Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 4 years ago
- ☆22Oct 26, 2020Updated 5 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Aug 31, 2021Updated 4 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Apr 28, 2022Updated 4 years ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆45Feb 28, 2026Updated 3 months ago
- ☆21Jul 5, 2024Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- Vietnamese Text Dataset - Wikipedia vi 2018☆15Feb 18, 2019Updated 7 years ago
- PyTorch code for NeurIPSW 2020 paper (4th Workshop on Meta-Learning) "Few-Shot Unsupervised Continual Learning through Meta-Examples"☆20Nov 2, 2020Updated 5 years ago
- Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks☆20Mar 26, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Versatile Generative Language Model☆25Oct 29, 2022Updated 3 years ago
- [ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers☆26Jun 7, 2023Updated 2 years ago
- Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"☆21Nov 4, 2022Updated 3 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Mar 22, 2021Updated 5 years ago
- Course materials for a 3-day seminar "Machine Learning and NLP: Advances and Applications" at New College of Florida☆12Feb 10, 2022Updated 4 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago