Stonesjtu / Pytorch-NCE
The Noise Contrastive Estimation for softmax output written in Pytorch
☆318Updated 5 years ago
Alternatives and similar repositories for Pytorch-NCE:
Users that are interested in Pytorch-NCE are comparing it to the libraries listed below
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆123Updated 5 years ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- categorical variational autoencoder using the Gumbel-Softmax estimator☆429Updated 7 years ago
- PyTorch Implementation of the paper Learning to Reweight Examples for Robust Deep Learning☆354Updated 5 years ago
- Implementation of Sparsemax activation in Pytorch☆159Updated 4 years ago
- PyTorch implementations of LSTM Variants (Dropout + Layer Norm)☆136Updated 3 years ago
- A PyTorch implementation of : Language Modeling with Gated Convolutional Networks.☆99Updated 3 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆426Updated 8 months ago
- Code for paper "Learning to Reweight Examples for Robust Deep Learning"☆270Updated 5 years ago
- pytorch implementation of VAE-Gumble-Softmax☆62Updated 4 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆250Updated 3 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆578Updated 5 years ago
- Virtual Adversarial Training (VAT) implementation for PyTorch☆296Updated 6 years ago
- PyTorch implementation of a Variational Autoencoder with Gumbel-Softmax Distribution☆207Updated 6 years ago
- Minimal RNN classifier with self-attention in Pytorch☆150Updated 3 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆245Updated 6 years ago
- Code for "Gradient Surgery for Multi-Task Learning"☆314Updated 4 years ago
- pytorch neural network attention mechanism☆147Updated 6 years ago
- Codes for "Towards Binary-Valued Gates for Robust LSTM Training".☆76Updated 6 years ago
- Sequence-to-Sequence learning using PyTorch☆522Updated 5 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆148Updated 5 years ago
- Implementation of Universal Transformer in Pytorch☆259Updated 6 years ago
- A pytorch implementation of the paper: "Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks"☆80Updated 6 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆407Updated 7 months ago
- Learning deep representations by mutual information estimation and maximization☆324Updated 6 years ago
- PyTorch implementation of batched bi-RNN encoder and attention-decoder.☆279Updated 6 years ago
- NEG loss implemented in pytorch☆124Updated 7 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆243Updated 5 years ago
- ☆83Updated 5 years ago
- Deep InfoMax (DIM), or "Learning Deep Representations by Mutual Information Estimation and Maximization"☆807Updated 5 years ago