Antimatter543 / karpathy-NN-lecturesLinks
My runthrough of karpathy's lectures (with notes), building NN's from scratch, simple autoregressive language models, GPT models and learnt ML techniques.
☆11Updated 2 years ago
Alternatives and similar repositories for karpathy-NN-lectures
Users that are interested in karpathy-NN-lectures are comparing it to the libraries listed below
Sorting:
- Simple Transformer in Jax☆139Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆24Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- Like picoGPT but for BERT.☆50Updated 2 years ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆156Updated 2 years ago
- ChatGPT Plugin to Semantically Search Google Maps☆44Updated 2 years ago
- ☆96Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Grounding LLM mathematical reasoning with proof assistants.☆63Updated 2 years ago
- ☆22Updated 2 years ago
- A really tiny autograd engine☆95Updated 4 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- ☆24Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Graphical Code Tracer (GCT): Visualize code at lightning speed☆53Updated last year
- smolbox of recipies☆28Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 11 months ago
- An introduction to LLM Sampling☆79Updated 9 months ago
- Mention any three favourite things and get recommendations in the form of a flow chart by Claude Haiku.☆13Updated last year
- The history files when recording human interaction while solving ARC tasks☆116Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated 2 years ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆277Updated 10 months ago
- ☆44Updated 3 months ago
- a tiny vectorstore implementation built with numpy.☆63Updated last year
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- compute, storage, and networking infra at home☆64Updated last year
- Andrej Kapathy's micrograd implemented in c☆30Updated last year
- An implement of deep learning framework and models in C☆48Updated 6 months ago
- ☆40Updated last year