Stonesjtu / Pytorch-NCELinks
The Noise Contrastive Estimation for softmax output written in Pytorch
☆319Updated 5 years ago
Alternatives and similar repositories for Pytorch-NCE
Users that are interested in Pytorch-NCE are comparing it to the libraries listed below
Sorting:
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 5 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆443Updated last year
- Implementation of Sparsemax activation in Pytorch☆161Updated 5 years ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- Implementation of Universal Transformer in Pytorch☆261Updated 6 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆148Updated 6 years ago
- PyTorch implementations of LSTM Variants (Dropout + Layer Norm)☆137Updated 4 years ago
- Codes for "Towards Binary-Valued Gates for Robust LSTM Training".☆75Updated 7 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆251Updated 3 years ago
- pytorch neural network attention mechanism☆147Updated 6 years ago
- A PyTorch implementation of : Language Modeling with Gated Convolutional Networks.☆99Updated 3 years ago
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆271Updated 3 years ago
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆230Updated 6 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆128Updated 4 years ago
- Minimal RNN classifier with self-attention in Pytorch☆150Updated 3 years ago
- Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"☆169Updated 6 years ago
- ☆83Updated 5 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆376Updated 4 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆245Updated 5 years ago
- Implementation of Dual Learning NMT on PyTorch☆163Updated 7 years ago
- A pytorch implementation of the paper: "Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks"☆81Updated 6 years ago
- Two-Layer Hierarchical Softmax Implementation for PyTorch☆69Updated 4 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Updated 5 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆580Updated 5 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆245Updated 6 years ago
- Sparse and structured neural attention mechanisms☆224Updated 4 years ago
- Code for Sluice networks: Learning what to share between loosely related tasks☆154Updated 6 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆410Updated last year
- ☆218Updated 5 years ago
- NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting☆109Updated 4 years ago