MNoorFawi / curloraLinks

The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.

☆44

Alternatives and similar repositories for curlora

Users that are interested in curlora are comparing it to the libraries listed below

Sorting:

kyleliang919 / Online-Subspace-Descent
This repo is based on https://github.com/jiaweizzhao/GaLore
☆29Updated 9 months ago
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆40Updated 8 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
NolanoOrg / SpectraSuite
☆48Updated 11 months ago
samchaineau / llm_slerp_generation
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆36Updated last year
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 4 months ago
Qichuzyy / POA
Official implementation of ECCV24 paper: POA
☆24Updated 10 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆77Updated last year
menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆39Updated 3 months ago
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆33Updated 3 months ago
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆126Updated 6 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆73Updated last month
EternityYW / Gemini-Commonsense-Evaluation
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"
☆36Updated last year
vis-nlp / ChartGemma
☆62Updated 11 months ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆39Updated 7 months ago
katiekang1998 / reasoning_generalization
☆32Updated 5 months ago
convergence-ai / lm2
Official repo of paper LM2
☆41Updated 4 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆86Updated last year
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41Updated last year
RobertCsordas / moeut
☆79Updated 10 months ago
Infini-AI-Lab / S2FT
☆17Updated 5 months ago
allenai / infinigram-api
☆61Updated 3 weeks ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
arcee-ai / DAM
☆51Updated 7 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆20Updated 6 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
devvrit / matformer
MatFormer repo
☆31Updated 6 months ago
snu-mllab / Context-Memory
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
☆61Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆111Updated 4 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated 11 months ago