predibase / lora_bakeoff
☆18Updated 6 months ago
Alternatives and similar repositories for lora_bakeoff:
Users that are interested in lora_bakeoff are comparing it to the libraries listed below
- ☆48Updated 4 months ago
- ☆32Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆39Updated 5 months ago
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆26Updated last month
- ☆43Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆43Updated 7 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆26Updated 6 months ago
- ☆27Updated 4 months ago
- Mixtral finetuning☆19Updated last year
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆27Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆85Updated last week
- LLM training in simple, raw C/CUDA☆14Updated 3 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago
- ☆50Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- ☆24Updated last year
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- Make triton easier☆47Updated 9 months ago
- ☆47Updated 7 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- ☆50Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 weeks ago
- ☆74Updated 7 months ago
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆63Updated 3 months ago