huggingface / llm_training_handbookLinks
An open collection of methodologies to help with successful training of large language models.
☆545Updated last year
Alternatives and similar repositories for llm_training_handbook
Users that are interested in llm_training_handbook are comparing it to the libraries listed below
Sorting:
- An open collection of implementation tips, tricks and resources for training large language models☆490Updated 2 years ago
- ☆559Updated last year
- A repository for research on medium sized language models.☆524Updated 6 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆631Updated last year
- LLM Workshop by Sourab Mangrulkar☆399Updated last year
- distributed trainer for LLMs☆587Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆733Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆551Updated last year
- Scaling Data-Constrained Language Models☆343Updated 6 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆259Updated 2 years ago
- Evaluation suite for LLMs☆376Updated 5 months ago
- Build, evaluate, understand, and fix LLM-based apps☆492Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆316Updated 2 years ago
- batched loras☆347Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆508Updated 2 years ago
- Official repository for ORPO☆468Updated last year
- Generative Representational Instruction Tuning☆681Updated 6 months ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆317Updated 2 years ago
- Pre-training code for Amber 7B LLM☆170Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆482Updated last year
- Fast Inference Solutions for BLOOM☆565Updated last year
- Data and tools for generating and inspecting OLMo pre-training data.☆1,384Updated last month
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆598Updated 2 years ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆583Updated 2 years ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆269Updated 7 months ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated 2 years ago
- Scalable toolkit for efficient model alignment☆847Updated 2 months ago
- Repository for organizing datasets and papers used in Open LLM.☆101Updated 2 years ago
- A bagel, with everything.☆325Updated last year
- A curated list of awesome instruction tuning datasets, models, papers and repositories.☆345Updated 2 years ago