hkproj / pytorch-transformer
Attention is all you need implementation
☆911Updated 11 months ago
Alternatives and similar repositories for pytorch-transformer:
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆273Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆323Updated last year
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆461Updated 5 months ago
- Code Transformer neural network components piece by piece☆343Updated 2 years ago
- Stable Diffusion implemented from scratch in PyTorch☆850Updated 6 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆102Updated last year
- 100 days of building GPU kernels!☆399Updated last week
- ☆159Updated 4 months ago
- GPU Kernels☆172Updated last week
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆182Updated 10 months ago
- I will build Transformer from scratch☆68Updated 11 months ago
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆244Updated last year
- Notes on quantization in neural networks☆81Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆214Updated last year
- Transformers 3rd Edition☆412Updated last week
- ☆738Updated 10 months ago
- ☆1,083Updated 3 weeks ago
- Tutorial for how to build BERT from scratch☆92Updated 11 months ago
- Personal short implementations of Machine Learning papers☆250Updated last year
- Beginner Level Deep Learning Tutorials in Pytorch with Youtube Videos!☆343Updated 5 months ago
- Transformer: PyTorch Implementation of "Attention Is All You Need"☆3,679Updated 9 months ago
- LLM (Large Language Model) FineTuning☆533Updated last month
- Shortest solutions for CS231n 2021-2025☆327Updated this week
- ☆1,172Updated 2 months ago
- ☆120Updated 10 months ago
- Notes about LLaMA 2 model☆59Updated last year
- Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"☆1,066Updated 2 months ago
- The Multilayer Perceptron Language Model☆547Updated 9 months ago
- Notes and commented code for RLHF (PPO)☆90Updated last year
- ☆82Updated last year