karpathy / transformersLinks
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β127Updated 2 years ago
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- The Multilayer Perceptron Language Modelβ551Updated 10 months ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world β¦β638Updated last year
- The Tensor (or Array)β436Updated 10 months ago
- The Autograd Engineβ616Updated 9 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ264Updated this week
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried β¦β112Updated last year
- root repoβ37Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorchβ271Updated 7 months ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.β106Updated 7 years ago
- nice and effective super simple calorie counter web appβ98Updated last year
- Notebooks and various random funβ1,095Updated 2 years ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.β801Updated last week
- Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."β168Updated this week
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementaβ¦β218Updated 5 months ago
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.β69Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfishβ173Updated 10 months ago
- Implementation of Diffusion Transformer (DiT) in JAXβ278Updated last year
- I will build Transformer from scratchβ70Updated last year
- learningggggggg π³β526Updated 2 months ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024β242Updated last year
- Deep Learning Fundamentals -- Code material and exercisesβ379Updated last year
- Alex Krizhevsky's original code from Google Codeβ192Updated 9 years ago
- UNet diffusion model in pure CUDAβ608Updated 11 months ago
- nanoGPT style version of Llama 3.1β1,380Updated 10 months ago
- The n-gram Language Modelβ1,424Updated 10 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- Fast bare-bones BPE for modern tokenizer trainingβ159Updated 2 months ago
- Persistent dict, backed by sqlite3 and pickle, multithread-safe.β14Updated 5 years ago
- Puzzles for exploring transformersβ349Updated 2 years ago
- This repository contains everything you need to become proficient in ML/AI Research and Research Papersβ593Updated last year