geohot / minGPTLinks
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆39Updated 5 years ago
Alternatives and similar repositories for minGPT
Users that are interested in minGPT are comparing it to the libraries listed below
Sorting:
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆143Updated 2 years ago
- ☆96Updated last year
- A really tiny autograd engine☆95Updated 3 months ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆57Updated 2 years ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Enabling tinygrad compatibility with the Google Edge TPU☆79Updated 11 months ago
- Torturing neural networks by forcing them to learn the Mandelbrot set.☆169Updated 5 months ago
- An implementation of delta-iris in tinygrad☆72Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Letting computers listen to you and really care☆369Updated 3 years ago
- Some ipython notebooks implementing AI algorithms☆1,369Updated 3 months ago
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 11 months ago
- Like picoGPT but for BERT.☆50Updated 2 years ago
- ☆111Updated last year
- Building Andrej Kapathy's micrograd from scratch☆40Updated 2 years ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆68Updated 6 months ago
- Some helpers and examples for creating an LLM fine-tuning dataset☆73Updated last year
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆22Updated 2 years ago
- ☆50Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- ☆39Updated last year
- Helper scripts and examples for exploring the Falcon LLM models☆174Updated 2 years ago
- ☆71Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 9 months ago
- Graphical Code Tracer (GCT): Visualize code at lightning speed☆53Updated last year
- LLM training in simple, raw C/CUDA☆18Updated last year
- My own repository containing the codes I wrote to practice CUDA programming.☆48Updated 2 years ago
- ☆166Updated 2 years ago
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Updated 4 months ago