cambridgeltl/composable-sft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cambridgeltl/composable-sft)

cambridgeltl / composable-sft

A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.

☆75

Alternatives and similar repositories for composable-sft

Users that are interested in composable-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cindyxinyiwang / emea
View on GitHub
☆13Dec 11, 2021Updated 4 years ago
UKPLab / adaptable-adapters
View on GitHub
☆25Jul 12, 2022Updated 4 years ago
bigscience-workshop / multilingual-modeling
View on GitHub
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆75Mar 2, 2024Updated 2 years ago
thevasudevgupta / transformers-adapters
View on GitHub
This repositary hosts my experiments for the project, I did with OffNote Labs.
☆10Apr 12, 2021Updated 5 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cbaziotis / lm-prior-for-nmt
View on GitHub
This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"
☆43Mar 16, 2021Updated 5 years ago
morningmoni / UniPELT
View on GitHub
Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022
☆64Mar 23, 2022Updated 4 years ago
McGill-NLP / polytropon
View on GitHub
☆54May 8, 2023Updated 3 years ago
yang-zhang / labse-pytorch
View on GitHub
Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model
☆21Sep 2, 2020Updated 5 years ago
dguo98 / DiffPruning
View on GitHub
Parameter Efficient Transfer Learning with Diff Pruning
☆74Feb 3, 2021Updated 5 years ago
Yaoming95 / CIAT
View on GitHub
code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation
☆18Oct 19, 2022Updated 3 years ago
georgepar / optimistic-adam
View on GitHub
PyTorch implementation of Optimistic Adam proposed in Training GANs with Optimism (https://arxiv.org/pdf/1711.00141.pdf)
☆20Jan 16, 2021Updated 5 years ago
alexandra-chron / lexical_xlm_relm
View on GitHub
PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…
☆18Oct 18, 2022Updated 3 years ago
wenlai-lavine / m4Adapter
View on GitHub
m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)
☆19Mar 28, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mrmaheshrajput / productionizing-llms
View on GitHub
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆12Mar 27, 2024Updated 2 years ago
Abonia1 / yolosegment2labelme
View on GitHub
yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…
☆10May 8, 2024Updated 2 years ago
dki-lab / few-shot-bioIE
View on GitHub
True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning
☆12Jul 6, 2022Updated 4 years ago
StonyBrookNLP / teabreac
View on GitHub
Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22
☆19Jun 23, 2023Updated 3 years ago
xlhex / dpe
View on GitHub
☆22Oct 26, 2020Updated 5 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
ShaojieJiang / tldr
View on GitHub
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Aug 11, 2023Updated 2 years ago
naoya-i / r4c
View on GitHub
r4c
☆14Mar 2, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Tikquuss / meta_XLM
View on GitHub
Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks
☆20Mar 26, 2021Updated 5 years ago
GURPREETKAURJETHRA / Multi-Agent-AI-App
View on GitHub
Multi-Agent AI App from Scratch in python without any depedency of framework
☆15Jan 7, 2025Updated last year
McGill-NLP / retriever-lm-reasoning
View on GitHub
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Nov 2, 2023Updated 2 years ago
ictnlp / PTE-NMT
View on GitHub
Source code for the NAACL 2021 paper: Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation
☆15Jul 19, 2021Updated 5 years ago
UniversalDependencies / UD_Polish-PDB
View on GitHub
Polish data.
☆13May 6, 2026Updated 2 months ago
Abonia1 / Fine-Tuning-LLMs-Key-Concepts-and-Terms
View on GitHub
Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…
☆13Sep 19, 2024Updated last year
jungokasai / beam_with_patience
View on GitHub
☆46Apr 13, 2022Updated 4 years ago
lorelupo / divide-and-rule
View on GitHub
☆12Oct 17, 2022Updated 3 years ago
ThomasScialom / T0_continual_learning
View on GitHub
Adding new tasks to T0 without catastrophic forgetting
☆33Oct 20, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oscar-defelice / TimeSeries-lectures
View on GitHub
This is a series of notebooks to support lectures on Time series analysis and forecast for a course I held in a master postgraduate progr…
☆15Nov 29, 2022Updated 3 years ago
INK-USC / CrossFit
View on GitHub
Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)
☆113Apr 28, 2022Updated 4 years ago
qiuzh20 / EMoE
View on GitHub
Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]
☆39May 28, 2024Updated 2 years ago
fbarez / neuroplasticity
View on GitHub
☆14Mar 31, 2024Updated 2 years ago
AlanAnsell / peft
View on GitHub
☆22Jul 5, 2024Updated 2 years ago
Shwai-He / MEO
View on GitHub
The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":
☆47Feb 28, 2026Updated 5 months ago
shtoshni / span-rep
View on GitHub
Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"
☆21Nov 4, 2022Updated 3 years ago