hkproj / pytorch-transformer
Attention is all you need implementation
☆790Updated 8 months ago
Alternatives and similar repositories for pytorch-transformer:
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆247Updated last year
- Code Transformer neural network components piece by piece☆330Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆292Updated last year
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆398Updated 2 months ago
- Stable Diffusion implemented from scratch in PyTorch☆733Updated 3 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆94Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆162Updated 8 months ago
- ☆125Updated last month
- ☆322Updated this week
- Tutorial for how to build BERT from scratch☆87Updated 8 months ago
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆235Updated 11 months ago
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆235Updated 9 months ago
- The "LLM Projects Archive" is a centralized GitHub repository, offering a diverse collection of Language Model Models projects. A valuabl…☆50Updated 2 weeks ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆141Updated 9 months ago
- This repository contains my solutions to the assignments for Stanford's CS224n "Natural Language Processing with Deep Learning" (Winter 2…☆131Updated last year
- ☆865Updated last month
- Assignment solutions for CS224N: Natural Language Processing with Deep Learning - Stanford / Winter 2023☆20Updated last year
- A 4-hour coding workshop to understand how LLMs are implemented and used☆865Updated last month
- Leetcode for Pytorch☆282Updated 2 weeks ago
- nanoGPT style version of Llama 3.1☆1,311Updated 6 months ago
- ☆78Updated 10 months ago
- Material for gpu-mode lectures☆3,691Updated this week
- Transformer: PyTorch Implementation of "Attention Is All You Need"☆3,332Updated 6 months ago
- Notes and commented code for RLHF (PPO)☆69Updated 11 months ago
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆152Updated last year
- ☆701Updated 8 months ago
- Notes on quantization in neural networks☆68Updated last year