Antimatter543 / karpathy-NN-lecturesLinks
My runthrough of karpathy's lectures (with notes), building NN's from scratch, simple autoregressive language models, GPT models and learnt ML techniques.
☆10Updated 2 years ago
Alternatives and similar repositories for karpathy-NN-lectures
Users that are interested in karpathy-NN-lectures are comparing it to the libraries listed below
Sorting:
- Simple Transformer in Jax☆142Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆23Updated last year
- A really tiny autograd engine☆99Updated 8 months ago
- ☆96Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- A visual interface for understanding and interpreting Transformers☆77Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31Updated 2 years ago
- An introduction to LLM Sampling☆79Updated last year
- A puzzle to learn about prompting☆135Updated 2 years ago
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆71Updated 11 months ago
- ☆41Updated 2 years ago
- ☆45Updated 7 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆117Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- smolbox of recipies☆29Updated 9 months ago
- ☆29Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Like picoGPT but for BERT.☆51Updated 2 years ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- Run, build, test transformer models using docker☆32Updated 2 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Updated 2 years ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- ☆22Updated 2 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 10 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- ☆13Updated 2 years ago
- Tensor library with autograd using only Rust's standard library☆71Updated last year