sail-sg / lorahubLinks

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

☆655

Alternatives and similar repositories for lorahub

Users that are interested in lorahub are comparing it to the libraries listed below

Sorting:

yule-BUAA / MergeLM
Codebase for Merging Language Models (ICML 2024)
☆853Updated last year
neelsjain / NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
☆401Updated last year
TencentARC / LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
☆510Updated last year
declare-lab / instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
☆548Updated last year
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆662Updated last year
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆520Updated 9 months ago
ContextualAI / HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
☆890Updated 3 weeks ago
Cohere-Labs-Community / parameter-efficient-moe
☆271Updated last year
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆439Updated last year
likenneth / honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆552Updated 8 months ago
fanqiwan / FuseAI
FuseAI Project
☆583Updated 8 months ago
Guitaricet / relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆465Updated last year
princeton-nlp / LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆631Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
sangmichaelxie / doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
☆341Updated last year
Shark-NLP / OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
☆575Updated 2 years ago
TIGER-AI-Lab / MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆377Updated last year
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆261Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆477Updated last year
hkust-nlp / deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆571Updated 10 months ago
xfactlab / orpo
Official repository for ORPO
☆463Updated last year
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆314Updated last year
pjlab-sys4nlp / llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
☆995Updated 10 months ago
suzgunmirac / BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
☆519Updated last year
google-research / distilling-step-by-step
☆560Updated 2 years ago
tatsu-lab / alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
☆825Updated last year
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆747Updated last year
princeton-nlp / SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆923Updated 8 months ago
kmeng01 / memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆519Updated last year
ContextualAI / gritlm
Generative Representational Instruction Tuning
☆675Updated 3 months ago