tranquoctrinh / transformerLinks
This is a PyTorch implementation of the Transformer model in the paper Attention is All You Need
☆32Updated 6 months ago
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆117Updated 2 years ago
- Personal short implementations of Machine Learning papers☆250Updated last year
- A clean PyTorch implementation of the original Transformer model + A German -> English translation example☆37Updated 3 years ago
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆214Updated last year
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆350Updated last year
- a simplified version of Meta's Llama 3 model to be used for learning☆42Updated last year
- Attention is all you need implementation☆1,051Updated last year
- Code Transformer neural network components piece by piece☆364Updated 2 years ago
- This repo implements Denoising Diffusion Probabilistic Models (DDPM) in Pytorch☆148Updated 10 months ago
- The best collection of AI tutorials to make you a boss of Data Science!☆102Updated 3 months ago
- LLaMA 2 implemented from scratch in PyTorch☆355Updated 2 years ago
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆264Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆188Updated 10 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆177Updated 2 months ago
- a way to SSH into Kaggle!☆73Updated 2 weeks ago
- A numpy implementation of the Transformer model in "Attention is All You Need"☆58Updated last year
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆172Updated last year
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆124Updated last year
- List of ML conferences with important dates and accepted paper list☆160Updated last week
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆391Updated last year
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆265Updated last year
- Variations of Kolmogorov-Arnold Networks☆115Updated last year
- Natural Language Processing Courses with Resources☆40Updated 2 weeks ago
- Fine-tuning Vision Transformers on various classification datasets☆109Updated last year
- Notes on quantization in neural networks☆104Updated last year
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆316Updated 2 years ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆119Updated last year
- ☆209Updated 9 months ago
- Stable Diffusion implemented from scratch in PyTorch☆981Updated 11 months ago
- A Great Collection of Deep Learning Tutorials and Repositories☆317Updated this week