Laz4rz / GPT-2View external linksLinks
Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Jul 31, 2024Updated last year
Alternatives and similar repositories for GPT-2
Users that are interested in GPT-2 are comparing it to the libraries listed below
Sorting:
- Rust Implementation of micrograd☆53Jul 3, 2024Updated last year
- ☆16Jan 26, 2025Updated last year
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,073Oct 5, 2025Updated 4 months ago
- i will automate factorio☆111Jul 31, 2024Updated last year
- High Quality Resources on GPU Programming/Architecture☆592Jul 26, 2024Updated last year
- ☆27Jul 9, 2024Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- could we make an ml stack in 100,000 lines of code?☆46Jul 17, 2024Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Jan 22, 2024Updated 2 years ago
- G4T0R2 - TEKNOFEST 2024 Türkçe Doğal Dil İşleme - Senaryo Ekibi #Acıkhack2024TDDİ☆10Jan 25, 2025Updated last year
- Simple Transformer in Jax☆142Jun 22, 2024Updated last year
- A platform aimed at creating websites that perform self-optimization☆12May 4, 2024Updated last year
- Best-of-N LLM editing with auto version control (+ other unix tools)☆39Apr 22, 2025Updated 9 months ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆63Jul 8, 2024Updated last year
- A browser extension that demos Gemini Nano via window.ai and Cartesia TTS ⚡️☆38Jul 10, 2024Updated last year
- ☆16Feb 18, 2024Updated last year
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆229Jan 2, 2025Updated last year
- ☆13Aug 10, 2024Updated last year
- ☆11Feb 13, 2024Updated 2 years ago
- Experimentation on google's gemma model☆16Mar 6, 2024Updated last year
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- ☆17Jul 9, 2025Updated 7 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- gpt-2 from scratch in mlx☆415Jun 12, 2024Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Jul 8, 2024Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- learningggggggg 🐳☆575Apr 2, 2025Updated 10 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Nov 4, 2024Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Jan 29, 2024Updated 2 years ago
- ☆14Apr 16, 2025Updated 9 months ago
- ☆15Apr 10, 2024Updated last year
- A tutorial example for nbdev☆15Feb 26, 2022Updated 3 years ago
- A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview☆14Jul 9, 2024Updated last year
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆225Aug 2, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Build visualizations live!