karpathy / nanoGPTLinks

The simplest, fastest repository for training/finetuning medium-sized GPTs.

☆50,264

Alternatives and similar repositories for nanoGPT

Users that are interested in nanoGPT are comparing it to the libraries listed below

Sorting:

karpathy / minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆23,024Updated last year
karpathy / llama2.c
Inference Llama 2 in one file of pure C
☆18,988Updated last year
ggml-org / llama.cpp
LLM inference in C/C++
☆90,508Updated this week
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆13,010Updated 11 months ago
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆20,157Updated last week
openai / tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
☆16,670Updated last month
karpathy / minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆10,183Updated last year
karpathy / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆4,584Updated last year
run-llama / llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
☆45,533Updated this week
karpathy / llm.c
LLM training in simple, raw C/CUDA
☆28,257Updated 5 months ago
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,983Updated last year
ggml-org / ggml
Tensor library for machine learning
☆13,648Updated last week
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,778Updated last year
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,236Updated last year
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,287Updated 6 months ago
meta-llama / llama
Inference code for Llama models
☆58,948Updated 10 months ago
karpathy / micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
☆13,900Updated last year
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,691Updated last week
huggingface / trl
Train transformer language models with reinforcement learning.
☆16,473Updated this week
karpathy / makemore
An autoregressive character-level language model for making more things
☆3,476Updated last year
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,684Updated 2 weeks ago
openai / evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
☆17,379Updated last month
abetlen / llama-cpp-python
Python bindings for llama.cpp
☆9,786Updated 3 months ago
karpathy / ng-video-lecture
☆4,368Updated last year
jaymody / picoGPT
An unnecessarily tiny implementation of GPT-2 in NumPy.
☆3,423Updated 2 years ago
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆18,061Updated last month
langchain-ai / langchain
🦜🔗 The platform for reliable agents.
☆120,580Updated last week
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,975Updated last week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆20,804Updated last week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,790Updated last week