A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Aug 9, 2024Updated last year
Alternatives and similar repositories for composable-sft
Users that are interested in composable-sft are comparing it to the libraries listed below
Sorting:
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆59Jan 14, 2022Updated 4 years ago
- ☆13Dec 11, 2021Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated last year
- ☆46Apr 13, 2022Updated 3 years ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆63Mar 23, 2022Updated 3 years ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- This Streamlit application creates an interactive Data Visualization Assistant that can understand Natural Language Queries and generate …☆17Jan 13, 2025Updated last year
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- ☆25Jul 12, 2022Updated 3 years ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Sep 19, 2024Updated last year
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- Minute-long video generation at 24FPS.☆55Feb 2, 2026Updated last month
- Collect papers about Mamba (a selective state space model).☆14Aug 6, 2024Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Reproducing RigL (ICML 2020) as a part of ML Reproducibility Challenge 2020☆29Jan 6, 2022Updated 4 years ago
- ☆54May 8, 2023Updated 2 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- ☆14May 10, 2021Updated 4 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- Fork of diux-dev/imagenet18☆16Oct 4, 2018Updated 7 years ago
- The source code and dataset for our paper "Integrating Relation Constraints with Neural Relation Extractors" which is publicated at AAAI …☆15Mar 25, 2020Updated 5 years ago
- Securing LLM's Against Top 10 OWASP Large Language Model Vulnerabilities 2024☆22May 10, 2024Updated last year
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- An intelligent agent utilizing Large Language Models (LLMs) for automated financial news retrieval and stock price prediction.☆21Sep 9, 2024Updated last year
- Code for the TACL paper "Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings"☆16Sep 8, 2020Updated 5 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆42Mar 16, 2021Updated 4 years ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39May 28, 2024Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Apr 28, 2022Updated 3 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Oct 19, 2022Updated 3 years ago
- A simple algorithm to identify and correct for label shift.☆21Feb 4, 2018Updated 8 years ago
- ☆77Apr 29, 2024Updated last year
- Ongoing research training transformer models at scale☆18Jul 27, 2023Updated 2 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆75Feb 3, 2021Updated 5 years ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Feb 28, 2026Updated last week