CyberZHG / keras-adaptive-softmax
Adaptive embedding and softmax
☆17Updated 3 years ago
Alternatives and similar repositories for keras-adaptive-softmax:
Users that are interested in keras-adaptive-softmax are comparing it to the libraries listed below
- Quasi-RNN for language modeling☆57Updated 8 years ago
- Ordered Neurons LSTM☆30Updated 3 years ago
- Implementation in pytorch of SR-NMT https://arxiv.org/abs/1805.04185v1☆25Updated 6 years ago
- ☆26Updated 6 years ago
- ☆42Updated 6 years ago
- ☆14Updated 5 years ago
- A Keras implementation of Adaptive Softmax☆7Updated 6 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 5 years ago
- a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)☆53Updated 2 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Updated 2 years ago
- BERT Extension in TensorFlow☆30Updated 5 years ago
- Highway networks implemented in PyTorch.☆56Updated 7 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- Keras implementation of “Gated Linear Unit ”☆23Updated 9 months ago
- This repo is for residual-connected sentence encoder for NLI.☆11Updated 7 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- Interpretable Models for NLP using PyTorch☆18Updated 7 years ago
- Code for "Smaller Text Classifiers with Discriminative Cluster Embeddings" (NAACL 2018)☆29Updated 6 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆77Updated 2 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Updated 7 years ago
- fairseq: Convolutional Sequence to Sequence Learning (Gehring et al. 2017) by Chainer☆64Updated 7 years ago
- Knowledge Distillation For Transformer Language Models☆52Updated last year
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- A sentence encoding-based model for natural language inference☆31Updated 6 years ago
- Pytorch implementation of bytenet from "Neural Machine Translation in Linear Time" paper☆46Updated 7 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆61Updated 6 years ago
- Code and dataset for "Transfer Learning Between Related Tasks Using Expected Label Proportions"☆16Updated 5 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Updated 7 years ago
- Sub-Character Representation Learning☆25Updated 6 years ago