karpathy / transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β106Updated 2 years ago
Related projects β
Alternatives and complementary repositories for transformers
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.β98Updated 6 years ago
- The Tensor (or Array)β408Updated 2 months ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world β¦β602Updated 9 months ago
- root repoβ35Updated last year
- Notebooks and various random funβ1,080Updated last year
- The Multilayer Perceptron Language Modelβ521Updated 3 months ago
- The Autograd Engineβ528Updated last month
- Fast bare-bones BPE for modern tokenizer trainingβ142Updated 2 weeks ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.β706Updated last month
- Code Transformer neural network components piece by pieceβ296Updated last year
- Building Andrej Kapathy's micrograd from scratchβ33Updated last year
- Jupyter Notebook notes on Andrej Karpathy's tutorial series, "Neural Networks: Zero to Hero."β124Updated 3 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementaβ¦β168Updated 3 weeks ago
- nice and effective super simple calorie counter web appβ92Updated 5 months ago
- A numeric optimization package for Torch.β12Updated 3 years ago
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried β¦β99Updated 6 months ago
- β100Updated 2 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in Cβ120Updated 4 months ago
- UNet diffusion model in pure CUDAβ567Updated 4 months ago
- Solve puzzles to improve your tinygrad skills!β87Updated last month
- Implementation of Diffusion Transformer (DiT) in JAXβ252Updated 4 months ago
- Persistent dict, backed by sqlite3 and pickle, multithread-safe.β11Updated 4 years ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfishβ167Updated 3 months ago
- Convoluting Ξ·-dimensional tensors over abstract manifolds.β54Updated this week
- learningggggggg π³β120Updated last month
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelfβ197Updated 7 months ago
- Machine Learning Q and AI bookβ341Updated last month
- System 2 Reasoning Link Collectionβ683Updated last week
- An ML Systems Onboarding listβ539Updated 3 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ194Updated 6 months ago