arunvenk / DaD
Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. dynamical system) model.
☆33Updated 8 years ago
Alternatives and similar repositories for DaD:
Users that are interested in DaD are comparing it to the libraries listed below
- ZForcing Repo☆40Updated 7 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Updated 6 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- ☆62Updated 8 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 6 years ago
- ☆68Updated 6 years ago
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Updated 4 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Bayesian Backprop RNN implementation pytorch https://arxiv.org/abs/1704.02798☆25Updated 7 years ago
- ☆12Updated 6 years ago
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆25Updated 7 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆37Updated 6 years ago
- Differentiable Neural Computer in TensorFlow☆26Updated 8 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 5 years ago
- Source code of the neural Hawkes particle smoothing (ICML 2019)☆43Updated 5 years ago
- Example implementation of the Bayesian neural network in "Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteri…☆30Updated 4 years ago
- Python implementation of the PR-SSM.☆51Updated 6 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆32Updated 7 years ago
- ☆80Updated 7 years ago
- Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"☆28Updated 4 years ago
- Multiplicative Normalizing Flow (MNF) posteriors for variational Bayesian neural networks☆65Updated 4 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆40Updated 6 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32Updated 8 years ago
- Z Forcing: Training Stochastic RNN's, NIPS'17☆32Updated 7 years ago
- ☆52Updated 4 years ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Updated 4 years ago
- PyTorch implementation of AVF☆45Updated 4 years ago