karpathy / transformersLinks
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β173Updated 3 years ago
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.β127Updated 7 years ago
- root repoβ67Updated 2 years ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world β¦β705Updated last year
- nice and effective super simple calorie counter web appβ126Updated last year
- A numeric optimization package for Torch.β37Updated 4 years ago
- Notebooks and various random funβ1,139Updated 2 years ago
- Game making library for using Canvas elementβ91Updated 2 years ago
- Persistent dict, backed by sqlite3 and pickle, multithread-safe.β35Updated 5 years ago
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried β¦β123Updated last year
- β128Updated last year
- An autoregressive character-level language model for making more thingsβ3,605Updated last year
- Code Transformer neural network components piece by pieceβ371Updated 2 years ago
- Frontier Models playing the board game Diplomacy.β624Updated last month
- small auto-grad engine inspired from Karpathy's micrograd and PyTorchβ276Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ400Updated 2 months ago
- I will build Transformer from scratchβ84Updated 6 months ago
- Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."β202Updated 2 weeks ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)β334Updated 2 years ago
- This repository contains an exhaustive coverage of a hands on approach to PyTorch along side powerful tools to accelerate model tuning anβ¦β226Updated last month
- β34Updated 9 years ago
- Artificial Life simulator using canvas. Based on https://github.com/karpathy/scriptsbotsβ102Updated 13 years ago
- Building Andrej Kapathy's micrograd from scratchβ46Updated 2 years ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in Cβ144Updated last year
- Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposesβ1,857Updated 4 years ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.β829Updated 6 months ago
- Fast bare-bones BPE for modern tokenizer trainingβ174Updated 7 months ago
- UNet diffusion model in pure CUDAβ662Updated last year
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfishβ172Updated last year
- Some helpers and examples for creating an LLM fine-tuning datasetβ74Updated last year
- Learnings and programs related to CUDAβ431Updated 7 months ago