karpathy / transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β121Updated 2 years ago
Alternatives and similar repositories for transformers:
Users that are interested in transformers are comparing it to the libraries listed below
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world β¦β631Updated last year
- root repoβ36Updated last year
- The Tensor (or Array)β429Updated 8 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.β784Updated last month
- The Multilayer Perceptron Language Modelβ544Updated 8 months ago
- The Autograd Engineβ596Updated 7 months ago
- Implementation of Diffusion Transformer (DiT) in JAXβ270Updated 10 months ago
- β232Updated last week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ158Updated last week
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementaβ¦β214Updated 3 months ago
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried β¦β108Updated 11 months ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024β244Updated 11 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorchβ251Updated 4 months ago
- High Quality Resources on GPU Programming/Architectureβ585Updated 8 months ago
- Deep Learning Fundamentals -- Code material and exercisesβ369Updated last year
- UNet diffusion model in pure CUDAβ601Updated 9 months ago
- β103Updated 7 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculumβ61Updated 4 months ago
- GPU Kernelsβ160Updated last week
- learningggggggg π³β499Updated 2 weeks ago
- Notebooks and various random funβ1,094Updated 2 years ago
- Machine Learning Q and AI bookβ407Updated 6 months ago
- Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."β160Updated last week
- β118Updated 2 months ago
- 100 days of building GPU kernels!β336Updated this week
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)β266Updated last year
- nice and effective super simple calorie counter web appβ94Updated 10 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a modelβ141Updated 9 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastβ¦β315Updated last month
- Code Transformer neural network components piece by pieceβ338Updated last year