hyunwoongko / transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆3,501Updated 7 months ago
Alternatives and similar repositories for transformer:
Users that are interested in transformer are comparing it to the libraries listed below
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,149Updated this week
- An annotated implementation of the Transformer paper.☆6,089Updated 11 months ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,080Updated 11 months ago
- ☆11,074Updated 2 weeks ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆22,144Updated 2 weeks ago
- Google AI 2018 BERT pytorch implementation☆6,335Updated last year
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,383Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆14,498Updated 7 months ago
- Diffusion model papers, survey, and taxonomy☆3,138Updated 3 weeks ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,423Updated last year
- Transformer implementation in PyTorch.☆478Updated 6 years ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,497Updated last week
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,922Updated 3 months ago
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,115Updated 3 years ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆4,804Updated 7 months ago
- Fast and memory-efficient exact attention☆16,370Updated last week
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,074Updated 11 months ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆2,008Updated 2 years ago
- Mamba SSM architecture☆14,291Updated 2 months ago
- The GitHub repository for the paper "Informer" accepted by AAAI 2021.☆5,753Updated 9 months ago
- Reading list for research topics in multimodal machine learning☆6,331Updated 7 months ago
- Code release for ConvNeXt model☆5,918Updated 2 years ago
- Code examples in pyTorch and Tensorflow for CS230☆4,020Updated last year
- Denoising Diffusion Probabilistic Models☆4,214Updated last year
- Pytorch library for fast transformer implementations☆1,686Updated last year
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,464Updated 2 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,237Updated last year
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,495Updated last year
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆11,235Updated last week
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,649Updated 7 months ago