geohot / minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆37Updated 4 years ago
Alternatives and similar repositories for minGPT:
Users that are interested in minGPT are comparing it to the libraries listed below
- tiny corporation website☆7Updated this week
- Scripts and environment for the tinybox☆93Updated last year
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆49Updated 4 years ago
- Enabling tinygrad compatibility with the Google Edge TPU☆77Updated 7 months ago
- Implementation of Karpathy's micrograd in Mojo☆73Updated last year
- Some helpers and examples for creating an LLM fine-tuning dataset☆70Updated last year
- ☆38Updated last year
- Visualizing some of the internals of a neural network during training and inference.☆75Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Can RL solve simple problems?☆54Updated last year
- Like picoGPT but for BERT.☆49Updated 2 years ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆53Updated last year
- An implementation of delta-iris in tinygrad☆72Updated 8 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- comma body does a loop around the office☆26Updated last year
- Letting computers listen to you and really care☆370Updated 2 years ago
- Fastai implementation of @karpathy's miniGPT library☆15Updated 4 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 3 months ago
- ☆20Updated last year
- ☆27Updated 9 months ago
- ☆112Updated last year
- A single notebook for fine-tuning GPT-3.5 turbo☆32Updated 8 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆143Updated last year
- compression = AI☆53Updated 2 years ago
- ☆20Updated 6 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆129Updated 5 months ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆10Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆82Updated last year
- Code repository for Liquid Time-stochasticity networks (LTSs)☆22Updated last year