Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆171Jul 31, 2024Updated last year
Alternatives and similar repositories for GPT-2
Users that are interested in GPT-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jan 26, 2025Updated last year
- Rust Implementation of micrograd☆52Jul 3, 2024Updated last year
- From the Tensor to Stable Diffusion, a rough outline for a 10 week course.☆1,078Apr 5, 2026Updated last month
- High Quality Resources on GPU Programming/Architecture☆592Jul 26, 2024Updated last year
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆26Nov 11, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- could we make an ml stack in 100,000 lines of code?☆46Jul 17, 2024Updated last year
- i will automate factorio☆114Jul 31, 2024Updated last year
- Neural Networks from scratch in Go.☆21Jul 7, 2024Updated last year
- Tensor library with autograd using only Rust's standard library☆71Jul 1, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- Genome analysis toolkit☆12Apr 23, 2025Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Jan 22, 2024Updated 2 years ago
- learningggggggg 🐳☆618Apr 2, 2025Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A browser extension that demos Gemini Nano via window.ai and Cartesia TTS ⚡️☆38Jul 10, 2024Updated last year
- My code/notebook's following Karpathy's legendary deep learning course: https://www.youtube.com/@AndrejKarpathy☆23Jul 6, 2024Updated last year
- ☆16Feb 18, 2024Updated 2 years ago
- Best-of-N LLM editing with auto version control (+ other unix tools)☆39Apr 22, 2025Updated last year
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆227Jan 2, 2025Updated last year
- ☆24Dec 26, 2023Updated 2 years ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆143Feb 15, 2025Updated last year
- Recreating gpt-2 from scratch☆26Jul 6, 2024Updated last year
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆27Jul 9, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- NSA Triton Kernels written with GPT5 and Opus 4.1☆70Aug 12, 2025Updated 9 months ago
- gpt-2 from scratch in mlx☆427Jun 12, 2024Updated last year
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Julia workshop for undergrad physicists☆22Mar 18, 2021Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆20Feb 7, 2023Updated 3 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,432Nov 13, 2024Updated last year
- Just a bunch of benchmark logs for different LLMs☆125Jul 28, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆111Mar 7, 2025Updated last year
- ☆20Oct 25, 2025Updated 6 months ago
- A deep-dive on the entire history of deep-learning☆1,553Jul 16, 2024Updated last year
- Educational WIP☆70Feb 16, 2026Updated 3 months ago
- llama3 implementation one matrix multiplication at a time☆15,236May 23, 2024Updated last year
- SAC + CPL training humanoids to play piano☆13Mar 30, 2025Updated last year
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago