A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Aug 9, 2024Updated last year
Alternatives and similar repositories for composable-sft
Users that are interested in composable-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆59Jan 14, 2022Updated 4 years ago
- ☆13Dec 11, 2021Updated 4 years ago
- ☆25Jul 12, 2022Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆42Mar 16, 2021Updated 5 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Oct 19, 2022Updated 3 years ago
- ☆54May 8, 2023Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆74Feb 3, 2021Updated 5 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 3 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Ongoing research training transformer models at scale☆18Jul 27, 2023Updated 2 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Jun 23, 2023Updated 2 years ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆17Jan 9, 2025Updated last year
- ☆14May 10, 2021Updated 4 years ago
- 🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)☆17Jan 10, 2025Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- ☆22Oct 26, 2020Updated 5 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆20Sep 27, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- ☆12Jun 23, 2023Updated 2 years ago
- ☆46Apr 13, 2022Updated 3 years ago
- Reproducing RigL (ICML 2020) as a part of ML Reproducibility Challenge 2020☆29Jan 6, 2022Updated 4 years ago
- Polish data.☆13Nov 12, 2025Updated 4 months ago
- ☆12Oct 17, 2022Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- r4c☆14Mar 2, 2021Updated 5 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Aug 31, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Visualizing the Stanford Encyclopedia of Philosophy☆22Oct 24, 2017Updated 8 years ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Feb 28, 2026Updated last month
- ☆14Mar 31, 2024Updated last year
- A tool to make spelling Thai more convenient☆11Mar 30, 2024Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- ☆77Apr 29, 2024Updated last year