lilianweng / transformer-tensorflow
Implementation of Transformer Model in Tensorflow
☆467Updated 2 years ago
Alternatives and similar repositories for transformer-tensorflow:
Users that are interested in transformer-tensorflow are comparing it to the libraries listed below
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆712Updated 3 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Updated 3 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆537Updated 4 years ago
- A wrapper layer for stacking layers horizontally☆228Updated 3 years ago
- TensorFlow implementation of 'Attention Is All You Need (2017. 6)'☆348Updated 6 years ago
- Transformer implemented in Keras☆372Updated 3 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,112Updated 2 years ago
- Sequence to Sequence Models with PyTorch☆738Updated 3 years ago
- Transformer training code for sequential tasks☆610Updated 3 years ago
- Keras implementation of BERT with pre-trained weights☆814Updated 5 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆748Updated last year
- LSTM and QRNN Language Model Toolkit for PyTorch☆1,969Updated 3 years ago
- A repository containing tutorials for practical NLP using PyTorch☆535Updated 5 years ago
- Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction☆502Updated 3 years ago
- Implementation of Universal Transformer in Pytorch☆259Updated 6 years ago
- Visualizing RNNs using the attention mechanism☆749Updated 5 years ago
- An open source framework for seq2seq models in PyTorch.☆1,510Updated 2 years ago
- ☆3,640Updated 2 years ago
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,081Updated last week
- Dynamic seq2seq in TensorFlow, step by step☆996Updated 7 years ago
- Pytorch Implementation of Google BERT☆594Updated 5 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,181Updated 3 years ago
- Deep Reinforcement Learning For Sequence to Sequence Models☆767Updated 2 years ago
- Transformers for Longer Sequences☆598Updated 2 years ago
- A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"☆554Updated 4 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆579Updated 5 years ago
- PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349☆591Updated 2 months ago
- Hierarchical Attention Networks for Document Classification in PyTorch☆604Updated 5 years ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,507Updated 3 years ago
- PyTorch implementation of beam search decoding for seq2seq models☆337Updated 2 years ago