geohot / minGPTLinks
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆40Updated 5 years ago
Alternatives and similar repositories for minGPT
Users that are interested in minGPT are comparing it to the libraries listed below
Sorting:
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆144Updated 2 years ago
- openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Co…☆56Updated 4 years ago
- Letting computers listen to you and really care☆372Updated 3 years ago
- Some ipython notebooks implementing AI algorithms☆1,394Updated 7 months ago
- Some helpers and examples for creating an LLM fine-tuning dataset☆74Updated last year
- A really tiny autograd engine☆98Updated 7 months ago
- An implementation of delta-iris in tinygrad☆72Updated last year
- Torturing neural networks by forcing them to learn the Mandelbrot set.☆173Updated 10 months ago
- ☆96Updated last year
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆59Updated 2 years ago
- Scripts and environment for the tinybox☆95Updated last year
- ☆112Updated 2 years ago
- mergesort in many languages☆264Updated 2 years ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆218Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆355Updated last year
- ☆19Updated 2 years ago
- Helper scripts and examples for exploring the Falcon LLM models☆172Updated 2 years ago
- Helpers and such for working with Lambda Cloud☆51Updated 2 years ago
- Can RL solve simple problems?☆54Updated 2 years ago
- ☆89Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆104Updated 2 years ago
- ☆139Updated 2 years ago
- My own repository containing the codes I wrote to practice CUDA programming.☆64Updated 2 years ago
- ☆71Updated 2 years ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆116Updated last year
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆47Updated last year
- ☆127Updated 9 months ago
- ☆97Updated last week
- ☆144Updated 2 years ago
- Creating and Using an Open Assistant API locally (Pythia 12B GPT model)☆76Updated 2 years ago