geohot / minGPTLinks
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆40Updated 5 years ago
Alternatives and similar repositories for minGPT
Users that are interested in minGPT are comparing it to the libraries listed below
Sorting:
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆144Updated 2 years ago
- Letting computers listen to you and really care☆371Updated 3 years ago
- A really tiny autograd engine☆96Updated 6 months ago
- Scripts and environment for the tinybox☆95Updated last year
- Alex Krizhevsky's original code from Google Code☆198Updated 9 years ago
- My own repository containing the codes I wrote to practice CUDA programming.☆64Updated 2 years ago
- Helpers and such for working with Lambda Cloud☆51Updated 2 years ago
- An implementation of delta-iris in tinygrad☆72Updated last year
- Torturing neural networks by forcing them to learn the Mandelbrot set.☆172Updated 9 months ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆58Updated 2 years ago
- If tinygrad wasn't small enough for you...☆759Updated last year
- ☆112Updated 2 years ago
- Enabling tinygrad compatibility with the Google Edge TPU☆85Updated last year
- Helper scripts and examples for exploring the Falcon LLM models☆174Updated 2 years ago
- Solve puzzles to improve your tinygrad skills!☆164Updated last month
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- ☆96Updated last year
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆49Updated 4 years ago
- Tutorials on tinygrad☆444Updated 2 months ago
- Can RL solve simple problems?☆54Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆277Updated last year
- ☆40Updated last year
- ☆139Updated 2 years ago
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 9 months ago
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆22Updated 2 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- A simple wrapper to call openAI's chatGPT on the terminal written in Python☆80Updated 9 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆219Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆53Updated last year
- ☆97Updated this week