facebookresearch / adaptive-softmaxLinks

Implements an efficient softmax approximation as described in the paper "Efficient softmax approximation for GPUs" (http://arxiv.org/abs/1609.04309)

☆396

Alternatives and similar repositories for adaptive-softmax

Users that are interested in adaptive-softmax are comparing it to the libraries listed below

Sorting:

facebookarchive / MIXER
Mixed Incremental Cross-Entropy REINFORCE ICLR 2016
☆332Updated 8 years ago
ryankiros / layer-norm
Code and models from the paper "Layer Normalization"
☆244Updated 8 years ago
rafaljozefowicz / lm
☆165Updated 9 years ago
glample / rnn-benchmarks
Benchmarks for several RNN variations with different deep-learning frameworks
☆171Updated 6 years ago
nyu-dl / dl4mt-cdec
☆167Updated 9 years ago
paarthneekhara / byteNet-tensorflow
ByteNet for character-level language modelling
☆319Updated 8 years ago
jzilly / RecurrentHighwayNetworks
Recurrent Highway Networks - Implementations for Tensorflow, Torch7, Theano and Brainstorm
☆402Updated 6 years ago
facebookarchive / torch-rnnlib
This library provides utilities for creating and manipulating RNNs to model sequential data.
☆191Updated 8 years ago
nyu-dl / dl4mt-c2c
☆143Updated 8 years ago
DeNeutoy / act-tensorflow
Adaptive Computation Time algorithm in Tensorflow
☆259Updated 8 years ago
isi-nlp / Zoph_RNN
C++/CUDA toolkit for training sequence and sequence-to-sequence models across multiple GPUs
☆186Updated 8 years ago
harvardnlp / struct-attn
Code for Structured Attention Networks https://arxiv.org/abs/1702.00887
☆240Updated 8 years ago
mila-iqia / blocks-examples
Examples and scripts using Blocks
☆148Updated 9 years ago
zihangdai / mos
☆397Updated 6 years ago
JianGoForIt / YellowFin_Pytorch
auto-tuning momentum SGD optimizer
☆288Updated 6 years ago
facebookarchive / SCRNNs
This is a self contained software accompanying the paper titled: Learning Longer Memory in Recurrent Neural Networks: http://arxiv.org/ab…
☆168Updated 7 years ago
nearai / torchfold
Tools for PyTorch
☆222Updated 3 years ago
lium-lst / nmtpy
nmtpy is a Python framework based on dl4mt-tutorial to experiment with Neural Machine Translation pipelines.
☆125Updated 7 years ago
salesforce / matchbox
Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.
☆484Updated 3 years ago
mila-iqia / platoon
Multi-GPU mini-framework for Theano
☆196Updated 8 years ago
jimfleming / recurrent-entity-networks
TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".
☆273Updated 7 years ago
nyu-dl / dl4mt-multi
☆121Updated 8 years ago
shyamupa / snli-entailment
attention model for entailment on SNLI corpus implemented in Tensorflow and Keras
☆177Updated 8 years ago
kimiyoung / review_net
Review Network for Caption Generation
☆181Updated 7 years ago
soskek / attention_is_all_you_need
Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.
☆323Updated 8 years ago
magic282 / MXNMT
MXNet based Neural Machine Translation
☆118Updated 7 years ago
pbhatia243 / tf-layer-norm
TensorFlow implementation of normalizations such as Layer Normalization, HyperNetworks.
☆111Updated 9 years ago
OlavHN / bnlstm
Batch normalized LSTM for tensorflow
☆178Updated 8 years ago
cesc-park / CRCN
Coherence + Recurrent Neural Network + Convolutional Neural Network
☆142Updated 8 years ago
cooijmanstim / recurrent-batch-normalization
☆64Updated 8 years ago