Stonesjtu / Pytorch-NCELinks
The Noise Contrastive Estimation for softmax output written in Pytorch
☆319Updated 5 years ago
Alternatives and similar repositories for Pytorch-NCE
Users that are interested in Pytorch-NCE are comparing it to the libraries listed below
Sorting:
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 6 years ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- Implementation of Sparsemax activation in Pytorch☆163Updated 5 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆447Updated last year
- PyTorch implementations of LSTM Variants (Dropout + Layer Norm)☆137Updated 4 years ago
- A PyTorch implementation of : Language Modeling with Gated Convolutional Networks.☆102Updated 3 years ago
- Implementation of Universal Transformer in Pytorch☆263Updated 6 years ago
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆271Updated 3 years ago
- Codes for "Towards Binary-Valued Gates for Robust LSTM Training".☆74Updated 7 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆252Updated 3 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆580Updated 6 years ago
- pytorch neural network attention mechanism☆147Updated 6 years ago
- Minimal RNN classifier with self-attention in Pytorch☆151Updated 3 years ago
- A pytorch implementation of the paper: "Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks"☆83Updated 7 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆148Updated 6 years ago
- Two-Layer Hierarchical Softmax Implementation for PyTorch☆70Updated 4 years ago
- Code for Sluice networks: Learning what to share between loosely related tasks☆154Updated 6 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆246Updated 5 years ago
- Sampled Softmax Implementation for PyTorch☆44Updated 7 years ago
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆121Updated 5 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆415Updated last year
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆230Updated 6 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Updated 5 years ago
- pytorch implementation of Attention is all you need☆239Updated 4 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆247Updated 6 years ago
- Implementation of Dual Learning NMT on PyTorch☆163Updated 7 years ago
- ☆315Updated 3 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆128Updated 4 years ago
- ☆93Updated 4 years ago
- Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"☆170Updated 6 years ago