huggingface / llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
☆490Updated last year
Alternatives and similar repositories for llm_training_handbook
Users that are interested in llm_training_handbook are comparing it to the libraries listed below
Sorting:
- An open collection of implementation tips, tricks and resources for training large language models☆472Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Updated last year
- distributed trainer for LLMs☆575Updated 11 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,484Updated last year
- Scaling Data-Constrained Language Models☆334Updated 7 months ago
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆810Updated 10 months ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆546Updated last year
- Official repository for ORPO☆452Updated 11 months ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,119Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆698Updated last year
- ☆515Updated 5 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆587Updated last year
- Generative Representational Instruction Tuning☆628Updated 2 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆461Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆488Updated last year
- A curated list of awesome instruction tuning datasets, models, papers and repositories.☆333Updated last year
- A bagel, with everything.☆320Updated last year
- Official repository for LongChat and LongEval☆519Updated 11 months ago
- A repository for research on medium sized language models.☆495Updated last week
- A framework for the evaluation of autoregressive code generation language models.☆943Updated 6 months ago
- batched loras☆342Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆301Updated last year
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆608Updated last year
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,248Updated 2 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆945Updated 6 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆464Updated 3 months ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,381Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆250Updated last year