karpathy / transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆120Updated 2 years ago
Alternatives and similar repositories for transformers:
Users that are interested in transformers are comparing it to the libraries listed below
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆626Updated last year
- The Multilayer Perceptron Language Model☆542Updated 7 months ago
- The Tensor (or Array)☆425Updated 7 months ago
- The Autograd Engine☆581Updated 6 months ago
- root repo☆36Updated last year
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆129Updated 4 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆243Updated 10 months ago
- Fast bare-bones BPE for modern tokenizer training☆149Updated 4 months ago
- Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."☆149Updated this week
- ☆118Updated last month
- nice and effective super simple calorie counter web app☆94Updated 9 months ago
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried …☆107Updated 10 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆268Updated 9 months ago
- Deep Learning Fundamentals -- Code material and exercises☆368Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆96Updated last month
- Persistent dict, backed by sqlite3 and pickle, multithread-safe.☆13Updated 5 years ago
- Code Transformer neural network components piece by piece☆337Updated last year
- The n-gram Language Model☆1,399Updated 7 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆256Updated last year
- Building Andrej Kapathy's micrograd from scratch☆35Updated last year
- ☆119Updated last month
- Torturing neural networks by forcing them to learn the Mandelbrot set.☆148Updated 2 weeks ago
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆65Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆115Updated this week
- The original transformer implementation from scratch. It contains informative comments on each block☆36Updated 9 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆782Updated 2 weeks ago