tranquoctrinh / transformerLinks
This is a PyTorch implementation of the Transformer model in the paper Attention is All You Need
☆34Updated 8 months ago
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- Personal short implementations of Machine Learning papers☆248Updated last year
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆272Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆224Updated last year
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆359Updated last year
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆180Updated 4 months ago
- LLaMA 2 implemented from scratch in PyTorch☆363Updated 2 years ago
- This repo implements Denoising Diffusion Probabilistic Models (DDPM) in Pytorch☆156Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆117Updated 2 years ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆186Updated last year
- Code Transformer neural network components piece by piece☆370Updated 2 years ago
- Collection of tutorials on diffusion models, step-by-step implementation guide, scripts for generating images with AI, prompt engineering…☆150Updated 9 months ago
- ☆223Updated 11 months ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆128Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆200Updated last month
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆397Updated last year
- KAN for Vision Transformer☆255Updated last year
- ☆81Updated last year
- Notes on quantization in neural networks☆113Updated 2 years ago
- A numpy implementation of the Transformer model in "Attention is All You Need"☆58Updated last year
- Attention is all you need implementation☆1,118Updated last year
- Natural Language Processing Courses with Resources☆40Updated 2 months ago
- several types of attention modules written in PyTorch for learning purposes☆52Updated last year
- PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …☆83Updated last year
- 100 days of building GPU kernels!☆547Updated 7 months ago
- A clean PyTorch implementation of the original Transformer model + A German -> English translation example☆37Updated 3 years ago
- Variations of Kolmogorov-Arnold Networks☆115Updated last year
- Reproduction of DeepSeek-R1☆244Updated 8 months ago
- Note and Assignments for CS231n: Convolutional Neural Networks for Visual Recognition☆46Updated 4 years ago
- Simple Adaptation of BitNet☆32Updated last year
- A simple tutorial of Diffusion Probabilistic Models☆107Updated last year