hyunwoongko / transformerLinks
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆4,379Updated 6 months ago
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- An annotated implementation of the Transformer paper.☆6,940Updated last year
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,599Updated last year
- Transformer implementation in PyTorch.☆491Updated 6 years ago
- Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Tran…☆288Updated last year
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,781Updated 2 weeks ago
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,155Updated 3 years ago
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,422Updated 2 years ago
- Mamba SSM architecture☆16,979Updated last week
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆24,907Updated last week
- ☆12,239Updated last week
- Google AI 2018 BERT pytorch implementation☆6,517Updated 2 years ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,218Updated last year
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,111Updated last week
- Implementing Vi(sion)T(transformer)☆449Updated 2 years ago
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,668Updated 2 years ago
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,155Updated last year
- Attention is all you need implementation☆1,151Updated last year
- ☆4,174Updated 4 years ago
- A Collection of Variational Autoencoders (VAE) in PyTorch.☆7,572Updated 10 months ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,537Updated 2 years ago
- Implementation of Denoising Diffusion Probabilistic Model in Pytorch☆10,395Updated 5 months ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,405Updated last year
- Transformer Encoder PyTorch note☆131Updated 2 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,452Updated 2 years ago
- Denoising Diffusion Probabilistic Models☆4,994Updated 2 years ago
- Reading list for research topics in multimodal machine learning☆6,793Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,665Updated last year
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆8,851Updated this week
- ☆125Updated 7 months ago
- Diffusion model papers, survey, and taxonomy☆3,312Updated 3 months ago