geohot / minGPTLinks
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆40Updated 5 years ago
Alternatives and similar repositories for minGPT
Users that are interested in minGPT are comparing it to the libraries listed below
Sorting:
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆143Updated 2 years ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆58Updated 2 years ago
- Letting computers listen to you and really care☆371Updated 3 years ago
- Enabling tinygrad compatibility with the Google Edge TPU☆85Updated last year
- ☆96Updated last year
- ☆70Updated last year
- Helper scripts and examples for exploring the Falcon LLM models☆174Updated 2 years ago
- Some helpers and examples for creating an LLM fine-tuning dataset☆74Updated last year
- Scripts and environment for the tinybox☆94Updated last year
- ☆144Updated 2 years ago
- ☆112Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- A really tiny autograd engine☆95Updated 5 months ago
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆133Updated last week
- Creating and Using an Open Assistant API locally (Pythia 12B GPT model)☆76Updated 2 years ago
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆21Updated 2 years ago
- If tinygrad wasn't small enough for you...☆743Updated last year
- ☆127Updated 7 months ago
- Machine translation with tinygrad☆18Updated last year
- ☆138Updated last year
- ☆46Updated 2 years ago
- Inference Llama 2 in one file of pure Python☆422Updated last year
- A dataset of alignment research and code to reproduce it☆78Updated 2 years ago
- Visualizing some of the internals of a neural network during training and inference.☆77Updated last year
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- parallelized hyperdimensional tictactoe☆125Updated last year
- A simple wrapper to call openAI's chatGPT on the terminal written in Python☆80Updated 8 months ago
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 8 months ago