mosaicml / examples

Fast and flexible reference benchmarks

☆435

Related projects: ⓘ

sabetAI / BLoRA
batched loras
☆326Updated last year
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆1,111Updated this week
Vahe1994 / SpQR
☆520Updated 8 months ago
huggingface / llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
☆441Updated 7 months ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆455Updated last year
epfLLM / Megatron-LLM
distributed trainer for LLMs
☆521Updated 3 months ago
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆657Updated 5 months ago
mosaicml / streaming
A Data Streaming Library for Efficient Neural Network Training
☆1,076Updated this week
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆545Updated 2 months ago
databricks / megablocks
☆1,164Updated last week
SkunkworksAI / hydra-moe
☆409Updated 10 months ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆957Updated 3 weeks ago
lucidrains / RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆850Updated 10 months ago
hao-ai-lab / LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
☆1,099Updated 7 months ago
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆416Updated last year
huggingface / transformers-bloom-inference
Fast Inference Solutions for BLOOM
☆556Updated last month
srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆816Updated 8 months ago
SqueezeAILab / SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
☆629Updated last month
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆625Updated 7 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆248Updated 10 months ago
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆469Updated last year
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in Under 4 Bits
☆696Updated 3 months ago
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆509Updated this week
punica-ai / punica
Serving multiple LoRA finetuned LLM as one
☆946Updated 4 months ago
zphang / minimal-llama
☆453Updated 11 months ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆300Updated last year
forhaoliu / ringattention
Transformers with Arbitrarily Large Context
☆613Updated last month
bigscience-workshop / bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆972Updated last month
mlfoundations / open_lm
A repository for research on medium sized language models.
☆469Updated last month
IST-DASLab / sparsegpt
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
☆694Updated 3 weeks ago