arunvenk / DaD
Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. dynamical system) model.
☆33Updated 8 years ago
Alternatives and similar repositories for DaD:
Users that are interested in DaD are comparing it to the libraries listed below
- ZForcing Repo☆40Updated 7 years ago
- Python implementation of the PR-SSM.☆51Updated 6 years ago
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Updated 4 years ago
- State space modeling with recurrent neural networks☆45Updated 7 years ago
- Differentiable Neural Computer in TensorFlow☆26Updated 8 years ago
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆40Updated 6 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Updated 6 years ago
- PyTorch implementation of AVF☆45Updated 4 years ago
- ☆12Updated 6 years ago
- Variational Recurrent Auto-Encoder using LSTM encoder/decoder networks☆54Updated 8 years ago
- ☆62Updated 7 years ago
- ☆16Updated 8 years ago
- ☆68Updated 6 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 6 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆37Updated 6 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 4 years ago
- Variational Autoencoders with Gaussian Mixture Latent Space☆36Updated 7 years ago
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆25Updated 6 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 9 years ago
- Framework of DataLog Neural Program Synthesis☆26Updated 6 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32Updated 8 years ago
- Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"☆27Updated 4 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- Z Forcing: Training Stochastic RNN's, NIPS'17☆32Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago