michaelfarrell76 / Distributed-SGD
Parallel SGD, done locally and remote
☆14Updated 8 years ago
Alternatives and similar repositories for Distributed-SGD:
Users that are interested in Distributed-SGD are comparing it to the libraries listed below
- code for our IJCAI 2018 paper : "Lifelong Domain Word Embedding via Meta-Learning"☆11Updated 5 years ago
- Automatically build the deep learning models with ENAS☆31Updated 6 years ago
- Code of "Max-margin Deep Generative Models" (NIPS15)☆18Updated 9 years ago
- ☆38Updated 7 years ago
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 6 years ago
- Implementing FastSent in theano☆12Updated 8 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated last year
- ☆13Updated 7 years ago
- numpy implementation of net 2 net from the paper Net2Net: Accelerating Learning via Knowledge Transfer http://arxiv.org/abs/1511.05641☆53Updated 8 years ago
- Proximal Asynchronous SAGA☆12Updated 7 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26Updated 7 years ago
- Gated Recurrent Unit with Low-rank matrix factorization☆34Updated 9 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- Variational autoencoder in Theano☆12Updated 7 years ago
- ☆19Updated 6 years ago
- Python implementation of the infomration bottleneck method (tishby et al, 1999)☆36Updated 7 years ago
- RWA in pytorch☆14Updated 7 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Updated 6 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- Char RNN Language Model based on Tensorflow☆12Updated 8 years ago
- Neural-based Noise Filtering from Word Embeddings☆11Updated 7 years ago
- Professor Forcing, NIPS'16☆46Updated 7 years ago
- ☆74Updated 5 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Updated 8 years ago
- Various experiments on the [Fashion-MNIST](https://github.com/zalandoresearch/fashion-mnist) dataset from Zalando☆31Updated 7 years ago
- A Keras inspired training utility for PyTorch☆38Updated 6 years ago
- TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"☆30Updated 6 years ago
- Code for "On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length", ICLR 2019☆11Updated 2 years ago
- Pytorch implementation of bytenet from "Neural Machine Translation in Linear Time" paper☆46Updated 7 years ago
- Low-rank Highway Networks☆13Updated 9 years ago