mit-han-lab / offsite-tuningLinks

Offsite-Tuning: Transfer Learning without Full Model

☆374

Alternatives and similar repositories for offsite-tuning

Users that are interested in offsite-tuning are comparing it to the libraries listed below

Sorting:

sail-sg / lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆640Updated 11 months ago
JayZhang42 / FederatedGPT-Shepherd
Shepherd: A foundational framework enabling federated instruction tuning for large language models
☆237Updated 2 years ago
prateeky2806 / ties-merging
☆183Updated last year
Arnav0400 / ViT-Slim
Official code for our CVPR'22 paper “Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space”
☆250Updated last year
gstoica27 / ZipIt
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…
☆301Updated last year
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆252Updated last year
calpt / awesome-adapter-resources
Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning
☆194Updated last year
mlfoundations / task_vectors
Editing Models with Task Arithmetic
☆482Updated last year
huggingface / datablations
Scaling Data-Constrained Language Models
☆338Updated 2 weeks ago
QingruZhang / AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
☆336Updated 2 years ago
yxli2123 / LoftQ
☆223Updated last year
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated 10 months ago
Cohere-Labs-Community / parameter-efficient-moe
☆266Updated last year
yuhuixu1993 / qa-lora
Official PyTorch implementation of QA-LoRA
☆138Updated last year
facebookresearch / SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…
☆138Updated last year
Shark-NLP / OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
☆569Updated last year
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆427Updated last year
astramind-ai / Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆167Updated last year
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆170Updated last year
mlfoundations / model-soups
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
☆474Updated last year
benzakenelad / BitFit
Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
☆142Updated 2 years ago
microsoft / AdaMix
This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…
☆132Updated last year
lucidrains / speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
☆272Updated 6 months ago
locuslab / wanda
A simple and effective LLM pruning approach.
☆775Updated 11 months ago
CASE-Lab-UMD / LLM-Drop
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
☆174Updated 3 months ago
VITA-Group / LiGO
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennige…
☆92Updated last year
OpenNLPLab / TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM
☆247Updated last year
UIC-Liu-Lab / ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
☆282Updated last year
r-three / t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆452Updated last year
magic-research / Dataset_Quantization
[ICCV2023] Dataset Quantization
☆259Updated last year