bkitano / llama-from-scratchLinks

Llama from scratch, or How to implement a paper without crying

☆580

Alternatives and similar repositories for llama-from-scratch

Users that are interested in llama-from-scratch are comparing it to the libraries listed below

Sorting:

pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆394Updated last year
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆818Updated 2 months ago
predibase / llm_distillation_playbook
Best practices for distilling large language models.
☆578Updated last year
hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆354Updated 2 years ago
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,432Updated last year
SumanthRH / tokenization
A comprehensive deep dive into the world of tokens
☆226Updated last year
evanmiller / LLM-Reading-List
LLM papers I'm reading, mostly on inference and model compression
☆741Updated last year
srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆1,110Updated last year
AviSoori1x / makeMoE
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
☆751Updated 11 months ago
philschmid / deep-learning-pytorch-huggingface
☆1,300Updated 7 months ago
jsbaan / transformer-from-scratch
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆263Updated last year
huggingface / llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
☆536Updated last year
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆165Updated 3 months ago
EurekaLabsAI / mlp
The Multilayer Perceptron Language Model
☆568Updated last year
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆211Updated last year
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated 2 years ago
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆494Updated this week
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated last year
mlfoundations / open_lm
A repository for research on medium sized language models.
☆512Updated 4 months ago
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆661Updated last year
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆126Updated 2 years ago
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,252Updated last month
xfactlab / orpo
Official repository for ORPO
☆463Updated last year
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,010Updated last year
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆916Updated 5 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆197Updated 7 months ago
EurekaLabsAI / tensor
The Tensor (or Array)
☆449Updated last year
ethanyanjiali / minChatGPT
A minimum example of aligning language models with RLHF similar to ChatGPT
☆223Updated 2 years ago
nomic-ai / contrastors
Train Models Contrastively in Pytorch
☆753Updated 6 months ago