leimao / Two-Layer-Hierarchical-Softmax-PyTorchLinks

Two-Layer Hierarchical Softmax Implementation for PyTorch

☆69

Alternatives and similar repositories for Two-Layer-Hierarchical-Softmax-PyTorch

Users that are interested in Two-Layer-Hierarchical-Softmax-PyTorch are comparing it to the libraries listed below

Sorting:

rdspring1 / PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
☆123Updated 5 years ago
lancopku / Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
☆85Updated 2 years ago
tanyuqian / learning-data-manipulation
NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting
☆109Updated 4 years ago
yunjey / seq2seq-dataloader
PyTorch DataLoader for seq2seq
☆85Updated 6 years ago
rosinality / adaptive-softmax-pytorch
Adaptive Softmax implementation for PyTorch
☆81Updated 6 years ago
google-deepmind / lamb
LAnguage Modelling Benchmarks
☆138Updated 5 years ago
kefirski / pytorch_Highway
Highway network implemented in pytorch
☆80Updated 8 years ago
harvardnlp / var-attn
Latent Alignment and Variational Attention
☆327Updated 6 years ago
bplank / semi-supervised-baselines
Code for "Strong Baselines for Neural Semi-supervised Learning under Domain Shift" (Ruder & Plank, 2018 ACL)
☆61Updated 2 years ago
mttk / rnn-classifier
Minimal RNN classifier with self-attention in Pytorch
☆150Updated 3 years ago
BangLiu / QANet-PyTorch
Re-implement "QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension"
☆120Updated 6 years ago
whr94621 / NJUNMT-pytorch
☆93Updated 3 years ago
serrano-s / attn-tests
Checking the interpretability of attention on text classification models
☆49Updated 6 years ago
asiddhant / Active-NLP
Bayesian Deep Active Learning for Natural Language Processing Tasks
☆147Updated 6 years ago
FreedomIntelligence / complex-order
☆83Updated 5 years ago
salesforce / nonauto-nmt
PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"
☆271Updated 3 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
yzh119 / BPT
Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"
☆128Updated 4 years ago
fanglanting / skip-gram-pytorch
A complete pytorch implementation of skip-gram
☆191Updated 7 years ago
ottokart / beam_search
Beam search for neural network sequence to sequence (encoder-decoder) models.
☆34Updated 6 years ago
nadavbh12 / Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch
Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch
☆61Updated 6 years ago
takase / control-length
☆53Updated 4 years ago
thomlake / pytorch-attention
pytorch neural network attention mechanism
☆147Updated 6 years ago
blackredscarf / pytorch-SkipGram
Implementing Skip-gram Negative Sampling with pytorch
☆49Updated 6 years ago
Cheneng / HiararchicalAttentionGRU
Hierarchical Attention Networks for Document Classification in PyTorch
☆36Updated 6 years ago
keitakurita / Better_LSTM_PyTorch
An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.
☆134Updated 5 years ago
jojonki / Gated-Convolutional-Networks
A PyTorch implementation of : Language Modeling with Gated Convolutional Networks.
☆99Updated 3 years ago
guxd / DialogWAE
Source Code for DialogWAE: Multimodal Response Generation with Conditional Wasserstein Autoencoder (https://arxiv.org/abs/1805.12352)
☆125Updated 6 years ago
Stonesjtu / Pytorch-NCE
The Noise Contrastive Estimation for softmax output written in Pytorch
☆319Updated 5 years ago
kaushalshetty / Positional-Encoding
Encoding position with the word embeddings.
☆83Updated 7 years ago