yangsaiyong / tf-adaptive-softmax-lstm-lmLinks
The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.
☆100Updated 6 years ago
Alternatives and similar repositories for tf-adaptive-softmax-lstm-lm
Users that are interested in tf-adaptive-softmax-lstm-lm are comparing it to the libraries listed below
Sorting:
- Attention-based NMT with a coverage mechanism to indicate whether a source word is translated or not☆111Updated 5 years ago
- Implementation of Attention-over-Attention Neural Networks for Reading Comprehension (https://arxiv.org/abs/1607.04423) in TensorFlow☆177Updated 8 years ago
- souce code for "Accelerating Neural Transformer via an Average Attention Network"☆78Updated 5 years ago
- Tensorflow implementation of "Language Modeling with Gated Convolutional Networks"☆271Updated 8 years ago
- Quasi-RNN for language modeling☆57Updated 8 years ago
- ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager☆74Updated 7 years ago
- An attempt to implement the TreeLSTM in Theano☆44Updated 9 years ago
- Dynamic evaluation for pytorch language models, now includes hyperparameter tuning☆104Updated 7 years ago
- ☆42Updated 6 years ago
- fairseq: Convolutional Sequence to Sequence Learning (Gehring et al. 2017) by Chainer☆67Updated 7 years ago
- Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)☆91Updated 6 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 5 years ago
- ☆149Updated 2 years ago
- Bi-Directional Block Self-Attention☆122Updated 7 years ago
- An implementation of RNNsearch using TensorFlow☆67Updated 7 years ago
- Intent parsing and slot filling in PyTorch with seq2seq + attention☆159Updated 7 years ago
- An implementation of the Globally Normalized Reader☆58Updated 2 years ago
- Decomposable Attention Model for Sentence Pair Classification (from https://arxiv.org/abs/1606.01933)☆95Updated 8 years ago
- PyTorch, LSTM CRF model☆9Updated 7 years ago
- BiLSTM-CRF for sequence labeling in Dynet☆80Updated 7 years ago
- ☆38Updated 8 years ago
- Hierarchical Recurrent Encoder Decoder for Query Suggestion☆108Updated 8 years ago
- Tensorflow implementation of "A Structured Self-Attentive Sentence Embedding"☆193Updated 3 years ago
- Language Modeling☆156Updated 5 years ago
- ☆74Updated 8 years ago
- Tensorflow implementation of Dynamic Coattention Networks for Question Answering.☆100Updated 8 years ago
- Implementation in Tensorflow of A Structured Self-attentive Sentence Embedding with the sentiment analysis task.☆66Updated 7 years ago
- Character-Aware Neural Language Models. A keras-based implementation☆118Updated 3 years ago
- Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling☆146Updated 5 years ago
- ☆53Updated 8 years ago