evanmiller / LLM-Reading-ListLinks

LLM papers I'm reading, mostly on inference and model compression

☆746

Alternatives and similar repositories for LLM-Reading-List

Users that are interested in LLM-Reading-List are comparing it to the libraries listed below

Sorting:

srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆1,128Updated last year
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆580Updated last year
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆826Updated 3 months ago
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆376Updated 2 years ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆715Updated 2 years ago
SumanthRH / tokenization
A comprehensive deep dive into the world of tokens
☆227Updated last year
apoorvumang / prompt-lookup-decoding
☆577Updated last year
gpu-mode / awesomeMLSys
An ML Systems Onboarding list
☆942Updated 10 months ago
explosion / curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
☆893Updated last year
carlini / yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
☆1,034Updated 6 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆170Updated 5 months ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,014Updated last year
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆352Updated last year
mistralai / megablocks-public
☆863Updated last year
arpitingle / gpu-alpha
High Quality Resources on GPU Programming/Architecture
☆590Updated last year
linjames0 / Transformer-CUDA
An implementation of the transformer architecture onto an Nvidia CUDA kernel
☆195Updated 2 years ago
gpu-mode / resource-stream
GPU programming related news and material links
☆1,795Updated 2 months ago
sabetAI / BLoRA
batched loras
☆348Updated 2 years ago
SkunkworksAI / hydra-moe
☆415Updated 2 years ago
HazyResearch / aisys-building-blocks
Building blocks for foundation models.
☆579Updated last year
predibase / llm_distillation_playbook
Best practices for distilling large language models.
☆587Updated last year
eugeneyan / llm-paper-notes
Notes from the Latent Space paper club. Follow along or start your own!
☆241Updated last year
srush / Autodiff-Puzzles
☆460Updated last year
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆606Updated last year
rwitten / HighPerfLLMs2024
☆546Updated last year
clu0 / unet.cu
UNet diffusion model in pure CUDA
☆653Updated last year
Laz4rz / GPT-2
Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Updated last year
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆396Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆257Updated 2 years ago
sangmichaelxie / cs324_p2
Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆105Updated 2 years ago