facebookresearch / WhereDidMyOptimumGo
An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018
☆15Updated 5 years ago
Related projects: ⓘ
- TargetProp for RNNs☆28Updated 5 years ago
- Separating value functions across time-scales.☆17Updated 5 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 5 years ago
- Berg - Run GPU-backed experiments on gcloud☆25Updated 5 years ago
- ☆70Updated this week
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10Updated 6 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 6 years ago
- Backprop training of recurrent neural networks with Hebbian plastic connections☆20Updated 3 years ago
- Deterministic Policy Gradient using torch7☆44Updated 8 years ago
- Code to build VAE models that are jointly conditioned.☆35Updated 6 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆43Updated 8 years ago
- Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger☆31Updated 3 years ago
- E2C implementation in PyTorch☆43Updated 7 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- Cephes Mathematical Functions library wrapped for Torch☆47Updated 8 years ago
- Simple PuddleWorld DQN example using torch7☆29Updated 8 years ago
- ☆38Updated 7 years ago
- ☆25Updated 5 years ago
- ☆17Updated 7 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Cluttered MNIST Dataset☆50Updated 9 years ago
- ☆42Updated 5 years ago
- A very simple variant of adversarial training that yields excellent results on MNIST☆12Updated 8 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆43Updated 7 years ago
- ☆57Updated 6 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 7 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Updated 5 years ago
- ☆29Updated 7 years ago
- RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Base…☆38Updated 6 years ago
- Easing non-convex optimization with neural networks.☆22Updated 6 years ago