zimmerrol / attention-is-all-you-need-kerasLinks

Implementation of the Transformer architecture described by Vaswani et al. in "Attention Is All You Need"

☆28

Alternatives and similar repositories for attention-is-all-you-need-keras

Users that are interested in attention-is-all-you-need-keras are comparing it to the libraries listed below

Sorting:

lzfelix / keras_attention
An Attention Layer in Keras
☆43Updated 6 years ago
zimmerrol / keras-utility-layer-collection
Collection of custom layers and utility functions for Keras which are missing in the main framework.
☆62Updated 5 years ago
CyberZHG / keras-pos-embd
Position embedding layers in Keras
☆58Updated 3 years ago
iliaschalkidis / ELMo-keras
Re-implementation of ELMo on Keras
☆135Updated 2 years ago
CyberZHG / keras-layer-normalization
Layer normalization implemented in Keras
☆60Updated 3 years ago
roomylee / self-attentive-emb-tf
Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)
☆91Updated 7 years ago
26hzhang / Dense_BiLSTM
Tensorflow Implementation of Densely Connected Bidirectional LSTM with Applications to Sentence Classification
☆47Updated 7 years ago
dongjun-Lee / transfer-learning-text-tf
Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)
☆82Updated 3 years ago
flrngel / Self-Attentive-tensorflow
Tensorflow implementation of "A Structured Self-Attentive Sentence Embedding"
☆194Updated 4 years ago
tommykwh / TensorFlow-Seq2Seq
Implement en-fr translation task by implenting seq2seq, encoder-decoder in RNN layers with Attention mechanism and Beamsearch inference d…
☆21Updated 7 years ago
prernakhurana2 / Hierarchical-attention-network
My implementation of "Hierarchical Attention Networks for Document Classification" in Keras
☆26Updated 7 years ago
yangperasd / gated_cnn
Keras implementation of “Gated Linear Unit ”
☆23Updated last year
subho406 / Sequence-to-Sequence-and-Attention-from-scratch-using-Tensorflow
Sequence to Sequence and attention from scratch using Tensorflow
☆29Updated 8 years ago
roomylee / rnn-text-classification-tf
Tensorflow Implementation of Recurrent Neural Network (Vanilla, LSTM, GRU) for Text Classification
☆118Updated 7 years ago
taoshen58 / DiSAN
Code of Directional Self-Attention Network (DiSAN)
☆311Updated 7 years ago
Artaches / SSAN-self-attention-sentiment-analysis-classification
Self-Attention: A Better Building Block for Sentiment Analysis Neural Network Classifiers - paper's code for the 9th Workshop on Computat…
☆53Updated 6 years ago
AlexGidiotis / Document-Classifier-LSTM
A bidirectional LSTM with attention for multiclass/multilabel text classification.
☆173Updated last year
CyberZHG / keras-transformer-xl
Transformer-XL with checkpoint loader
☆68Updated 3 years ago
monk1337 / Various-Attention-mechanisms
This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention…
☆127Updated 4 years ago
foamliu / Self-Attention-Keras
自注意力与文本分类
☆119Updated 7 years ago
AidenHuen / Capsule-Text-Classification
利用keras搭建的胶囊网络（capsule network文本分类模型，包含RNN、CNN、HAN等，其中keras_utils包含了capsule层和attention层的keras实现
☆78Updated 7 years ago
DongjunLee / transformer-tensorflow
TensorFlow implementation of 'Attention Is All You Need (2017. 6)'
☆349Updated 7 years ago
taoshen58 / BiBloSA
Bi-Directional Block Self-Attention
☆122Updated 7 years ago
jg8610 / multi-task-learning
Multi-Task Learning in NLP
☆94Updated 8 years ago
hongweizeng / cnn-seq2seq
☆38Updated 8 years ago
mmehdig / seq2seq-attention-model
An implementation for attention model in Keras for sequence to sequence model.
☆20Updated 8 years ago
CyberZHG / keras-gpt-2
Load GPT-2 checkpoint and generate texts
☆127Updated 3 years ago
synthesio / hierarchical-attention-networks
Implementation of Hierarchical Attention Networks as presented in https://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf
☆57Updated 7 years ago
kaushalshetty / Structured-Self-Attention
A Structured Self-attentive Sentence Embedding
☆494Updated 6 years ago
stefan-it / capsnet-nlp
CapsNet for NLP
☆66Updated 6 years ago