stefbraun / rnn_benchmarks
RNN benchmarks of pytorch, tensorflow and theano
☆87Updated 6 years ago
Related projects: ⓘ
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆76Updated 5 years ago
- ☆38Updated this week
- Collection of TensorFlow Examples☆37Updated 6 years ago
- Compare outputs between layers written in Tensorflow and layers written in Pytorch☆72Updated 6 years ago
- A set of simple examples ported from PyTorch for Tensorflow Eager Execution☆73Updated 6 years ago
- Language Modeling☆156Updated 5 years ago
- The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.☆101Updated 5 years ago
- Jupyter notebook running through basic examples of Distributed TensorFlow☆71Updated 2 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- ☆112Updated this week
- Python library for extracting mini-batches of data from a data source for the purpose of training neural networks☆86Updated 5 years ago
- tf.keras + tf.data with Eager Execution☆74Updated 5 years ago
- Training RNNs as Fast as CNNs (Simple Recurrent Unit)☆30Updated 6 years ago
- Corrupted labels and label smoothing☆128Updated 6 years ago
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆122Updated 5 years ago
- Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer☆55Updated 6 years ago
- Model Compression Based on Geoffery Hinton's Logit Regression Method in Keras applied to MNIST 16x compression over 0.95 percent accuracy…☆62Updated 5 years ago
- ☆75Updated 7 years ago
- Keras implementation of Nested LSTMs☆90Updated 5 years ago
- Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.☆153Updated 5 years ago
- Sparse and structured neural attention mechanisms☆224Updated 4 years ago
- Pytorch implementation of bytenet from "Neural Machine Translation in Linear Time" paper☆47Updated 6 years ago
- Maxout Networks TensorFlow implementation presented in https://arxiv.org/abs/1302.4389☆56Updated 5 years ago
- A smoother activation function (undergrad code)☆104Updated 4 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 5 years ago
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 6 years ago
- Multi-GPU data-parallel training in Keras☆77Updated 6 years ago
- ☆105Updated this week
- Use TensorFlow efficiently☆95Updated 3 years ago
- Machine-generated summaries and highlights of the every accepted paper at Thirty-second Conference on Neural Information Processing Syste…☆70Updated 5 years ago