lilianweng / transformer-tensorflow
Implementation of Transformer Model in Tensorflow
☆442Updated last year
Related projects: ⓘ
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆702Updated 2 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆533Updated 4 years ago
- A repository containing tutorials for practical NLP using PyTorch☆529Updated 5 years ago
- Deep Reinforcement Learning For Sequence to Sequence Models☆767Updated last year
- Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch☆689Updated 3 years ago
- TensorFlow implementation of 'Attention Is All You Need (2017. 6)'☆348Updated 6 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,177Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,105Updated 2 years ago
- Sequence to Sequence Models with PyTorch☆734Updated 2 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,513Updated 4 years ago
- Sequence-to-Sequence learning using PyTorch☆521Updated 4 years ago
- An open source framework for seq2seq models in PyTorch.☆1,493Updated last year
- Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction☆499Updated 3 years ago
- Keras implementation of BERT with pre-trained weights☆814Updated 5 years ago
- Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with Ten…☆341Updated 7 months ago
- PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349☆584Updated last month
- Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…☆745Updated 2 years ago
- Transformer implemented in Keras☆370Updated 2 years ago
- Transformer training code for sequential tasks☆608Updated 3 years ago
- Hierarchical Attention Networks for Document Classification in PyTorch☆603Updated 4 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆654Updated 2 years ago
- ☆3,601Updated 2 years ago
- Fast BPE☆652Updated 3 months ago
- PyTorch implementation of beam search decoding for seq2seq models☆333Updated last year
- Simple XLNet implementation with Pytorch Wrapper☆576Updated 5 years ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,506Updated 3 years ago
- LSTM and QRNN Language Model Toolkit for PyTorch☆1,959Updated 2 years ago
- Neural Turing Machines (NTM) - PyTorch Implementation☆582Updated 6 years ago
- PyTorch implementation of batched bi-RNN encoder and attention-decoder.☆278Updated 5 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆577Updated 5 years ago