michaelfarrell76 / Distributed-SGD
Parallel SGD, done locally and remote
☆14Updated 8 years ago
Alternatives and similar repositories for Distributed-SGD:
Users that are interested in Distributed-SGD are comparing it to the libraries listed below
- code for our IJCAI 2018 paper : "Lifelong Domain Word Embedding via Meta-Learning"☆11Updated 6 years ago
- Proximal Asynchronous SAGA☆12Updated 7 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- ☆19Updated 6 years ago
- Professor Forcing, NIPS'16☆45Updated 8 years ago
- Repo for code for the NIPS paper entitled "An Architecture for Deep, Hierarchical Generative Models"☆14Updated 8 years ago
- Variational autoencoder in Theano☆12Updated 7 years ago
- Codes for the "Noisy Activation Functions" paper.☆17Updated 8 years ago
- Char RNN Language Model based on Tensorflow☆12Updated 8 years ago
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 6 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26Updated 7 years ago
- RWA in pytorch☆14Updated 7 years ago
- Implementing FastSent in theano☆12Updated 8 years ago
- Conditional Random Fields implemented as Lasagne layer☆10Updated 8 years ago
- Generalized Compressed Network Search with PyTorch☆26Updated 7 years ago
- Deep generative model for sentiment analysis☆34Updated 8 years ago
- ☆38Updated 7 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11Updated 9 years ago
- Question Dependent Recurrent Entity Network☆13Updated 7 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Updated 6 years ago
- Automatically build the deep learning models with ENAS☆31Updated 6 years ago
- Code for "On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length", ICLR 2019☆11Updated 2 years ago
- ☆13Updated 7 years ago
- Implementation of QA Networks☆10Updated 8 years ago
- a replicate of https://arxiv.org/pdf/1711.00937.pdf☆16Updated 7 years ago
- numpy implementation of net 2 net from the paper Net2Net: Accelerating Learning via Knowledge Transfer http://arxiv.org/abs/1511.05641☆53Updated 8 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Updated 8 years ago
- Low-rank Highway Networks☆13Updated 9 years ago
- Code of "Max-margin Deep Generative Models" (NIPS15)☆18Updated 9 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated 2 years ago