hkproj / pytorch-transformerLinks
Attention is all you need implementation
☆931Updated 11 months ago
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
Sorting:
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆279Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆328Updated last year
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆484Updated 5 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆104Updated last year
- Code Transformer neural network components piece by piece☆349Updated 2 years ago
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆247Updated last year
- ☆408Updated this week
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆186Updated 11 months ago
- Tutorial for how to build BERT from scratch☆93Updated last year
- Contains the public resources of Hands on GenAI book☆152Updated 4 months ago
- Distributed training (multi-node) of a Transformer model☆68Updated last year
- Beginner Level Deep Learning Tutorials in Pytorch with Youtube Videos!☆356Updated 6 months ago
- Notes and commented code for RLHF (PPO)☆94Updated last year
- Leetcode for Pytorch☆450Updated last week
- ☆83Updated last year
- Transformer: PyTorch Implementation of "Attention Is All You Need"☆3,752Updated 9 months ago
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆313Updated last year
- ☆125Updated 11 months ago
- ☆168Updated 5 months ago
- Deep Learning Fundamentals -- Code material and exercises☆376Updated last year
- ☆1,199Updated 3 months ago
- 100 days of building GPU kernels!☆430Updated last month
- In Generative AI with Large Language Models (LLMs), you’ll learn the fundamentals of how generative AI works, and how to deploy it in rea…☆550Updated 10 months ago
- Machine Learning Q and AI book☆426Updated 8 months ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆947Updated 4 months ago
- Notes about LLaMA 2 model☆59Updated last year
- LLM (Large Language Model) FineTuning☆538Updated 2 months ago
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆250Updated last year
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face …☆648Updated last week
- GPU Kernels☆178Updated last month