tranquoctrinh / transformerLinks
This is a PyTorch implementation of the Transformer model in the paper Attention is All You Need
☆36Updated 10 months ago
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆282Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆235Updated last year
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆362Updated last year
- ☆82Updated last year
- This repo implements Denoising Diffusion Probabilistic Models (DDPM) in Pytorch☆168Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆122Updated 2 years ago
- Personal short implementations of Machine Learning papers☆252Updated 2 years ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆402Updated last year
- KAN for Vision Transformer☆256Updated last year
- ☆64Updated 3 years ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆181Updated 6 months ago
- Code Transformer neural network components piece by piece☆372Updated 2 years ago
- Attention is all you need implementation☆1,164Updated last year
- The best collection of AI tutorials to make you a boss of Data Science!☆116Updated 6 months ago
- Instructions for connecting SSH between Kaggle and Visual Studio Code☆82Updated 3 months ago
- ☆13Updated last year
- List of ML conferences with important dates and accepted paper list☆208Updated last month
- PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, Mo…☆55Updated 3 years ago
- A Great Collection of Deep Learning Tutorials and Repositories☆358Updated last month
- ☆140Updated last year
- A clean PyTorch implementation of the original Transformer model + A German -> English translation example☆38Updated 4 years ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆189Updated last year
- A simple tutorial of Diffusion Probabilistic Models☆108Updated last year
- ☆62Updated last year
- This is the repository for the course Natural Language Processing at Asian Institute of Technology. Covers word vectors, spaCy, PyTorch…☆87Updated 9 months ago
- optimizer & lr scheduler & loss function collections in PyTorch☆387Updated this week
- Variations of Kolmogorov-Arnold Networks☆116Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆206Updated 3 weeks ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆179Updated 2 years ago
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆181Updated last year