Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Jul 31, 2024Updated last year
Alternatives and similar repositories for GPT-2
Users that are interested in GPT-2 are comparing it to the libraries listed below
Sorting:
- Rust Implementation of micrograd☆52Jul 3, 2024Updated last year
- ☆16Jan 26, 2025Updated last year
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,074Oct 5, 2025Updated 5 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- Tensor library with autograd using only Rust's standard library☆71Jul 1, 2024Updated last year
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆26Nov 11, 2024Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Jul 17, 2024Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Jan 22, 2024Updated 2 years ago
- A platform aimed at creating websites that perform self-optimization☆12May 4, 2024Updated last year
- Machine learning based credit risk prediction system☆27Nov 18, 2025Updated 3 months ago
- Best-of-N LLM editing with auto version control (+ other unix tools)☆39Apr 22, 2025Updated 10 months ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆63Jul 8, 2024Updated last year
- A browser extension that demos Gemini Nano via window.ai and Cartesia TTS ⚡️☆38Jul 10, 2024Updated last year
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆84May 22, 2025Updated 9 months ago
- ☆16Feb 18, 2024Updated 2 years ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆229Jan 2, 2025Updated last year
- ☆11Feb 13, 2024Updated 2 years ago
- Experimentation on google's gemma model☆16Mar 6, 2024Updated 2 years ago
- ☆15Feb 24, 2026Updated 2 weeks ago
- ☆13Aug 10, 2024Updated last year
- ☆11May 18, 2025Updated 9 months ago
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- ☆17Jul 9, 2025Updated 8 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- gpt-2 from scratch in mlx☆417Jun 12, 2024Updated last year
- learningggggggg 🐳☆576Apr 2, 2025Updated 11 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- ☆14Apr 16, 2025Updated 10 months ago
- ☆15Apr 10, 2024Updated last year
- NSA Triton Kernels written with GPT5 and Opus 4.1☆70Aug 12, 2025Updated 6 months ago
- A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview☆14Jul 9, 2024Updated last year
- Automatically annotates YOLO dataset using Moondream visual model☆20Aug 24, 2025Updated 6 months ago
- ☆16Jul 26, 2023Updated 2 years ago
- A tutorial example for nbdev☆15Feb 26, 2022Updated 4 years ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆226Aug 2, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- A standalone implementation of the DOM Events APIs, extracted from the Node.js codebase, for usage in WinterCG runtimes.☆18Mar 20, 2024Updated last year
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year