karpathy / transformersLinks
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β170Updated 3 years ago
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- root repoβ66Updated 2 years ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world β¦β700Updated last year
- nice and effective super simple calorie counter web appβ125Updated last year
- A numeric optimization package for Torch.β37Updated 4 years ago
- Automating research publications discovery and analysis. For example, ever wish your computer could automatically open papers that are moβ¦β321Updated 2 years ago
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried β¦β122Updated last year
- Notebooks and various random funβ1,134Updated 2 years ago
- Persistent dict, backed by sqlite3 and pickle, multithread-safe.β34Updated 5 years ago
- β126Updated 10 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)β331Updated 2 years ago
- Code Transformer neural network components piece by pieceβ370Updated 2 years ago
- Game making library for using Canvas elementβ90Updated 2 years ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorchβ277Updated last year
- Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."β198Updated last week
- Language model alignment-focused deep learning curriculumβ1,502Updated last year
- An autoregressive character-level language model for making more thingsβ3,508Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ395Updated last month
- Frontier Models playing the board game Diplomacy.β607Updated last month
- Llama from scratch, or How to implement a paper without cryingβ581Updated last year
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in Cβ139Updated last year
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementaβ¦β228Updated 11 months ago
- I will build Transformer from scratchβ87Updated 5 months ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024β248Updated last year
- Learnings and programs related to CUDAβ428Updated 5 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.β828Updated 4 months ago
- Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposesβ1,854Updated 4 years ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorchβ117Updated 2 years ago
- It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.β1,001Updated last year
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors β¦β1,425Updated 2 years ago
- UNet diffusion model in pure CUDAβ656Updated last year