geohot / minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆36Updated 4 years ago
Alternatives and similar repositories for minGPT:
Users that are interested in minGPT are comparing it to the libraries listed below
- openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Co…☆53Updated 3 years ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- An implementation of delta-iris in tinygrad☆71Updated 5 months ago
- Scripts and environment for the tinybox☆92Updated 9 months ago
- Can RL solve simple problems?☆53Updated last year
- comma body does a loop around the office☆26Updated last year
- Letting computers listen to you and really care☆369Updated 2 years ago
- ☆22Updated last year
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆142Updated last year
- Open-source simulator for autonomous driving research.☆22Updated 4 years ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- ☆18Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- A minimalist neural networks library built on a tiny autograd engine☆17Updated 7 months ago
- a writeup on some experiments on a sequence model for chess games☆28Updated 3 years ago
- ☆27Updated 6 months ago
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated 8 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆81Updated last year
- Extensive introductory writeup on Zig language functionalities☆10Updated 6 months ago
- A single notebook for fine-tuning GPT-3.5 turbo☆31Updated 5 months ago
- The history files when recording human interaction while solving ARC tasks☆96Updated this week
- could we make an ml stack in 100,000 lines of code?☆30Updated 6 months ago
- Stream of my favorite papers and links☆40Updated 4 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆24Updated 3 weeks ago
- Helper scripts and examples for exploring the Falcon LLM models☆173Updated last year
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated 7 months ago
- ☆71Updated last year
- ☆62Updated this week
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year