hyunwoongko / transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆3,692Updated 9 months ago
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,177Updated last year
- An annotated implementation of the Transformer paper.☆6,220Updated last year
- Google AI 2018 BERT pytorch implementation☆6,391Updated last year
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆22,812Updated 2 months ago
- ☆11,336Updated 2 months ago
- A Collection of Variational Autoencoders (VAE) in PyTorch.☆7,135Updated last month
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,126Updated 3 years ago
- Code examples in pyTorch and Tensorflow for CS230☆4,038Updated 2 years ago
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,395Updated last year
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,298Updated 3 weeks ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,973Updated 2 weeks ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆14,755Updated 9 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,389Updated last year
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆34,105Updated this week
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,364Updated last year
- Diffusion model papers, survey, and taxonomy☆3,182Updated 2 months ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,451Updated last year
- Code release for ConvNeXt model☆5,996Updated 2 years ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,708Updated this week
- Natural Language Processing Tutorial for Deep Learning Researchers☆14,591Updated last year
- Code for ALBEF: a new vision-language pre-training method☆1,650Updated 2 years ago
- Fast and memory-efficient exact attention☆17,346Updated last week
- An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)☆6,661Updated this week
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆11,921Updated 5 months ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,030Updated 2 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,786Updated 9 months ago
- Mamba SSM architecture☆14,868Updated last week
- Official DeiT repository☆4,194Updated last year
- Denoising Diffusion Probabilistic Models☆4,390Updated last year
- A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"☆556Updated 4 years ago