mbforbes / py-pomdpLinks
A small tool to parse a POMDP and load into python objects.
☆37Updated 5 years ago
Alternatives and similar repositories for py-pomdp
Users that are interested in py-pomdp are comparing it to the libraries listed below
Sorting:
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 8 years ago
- POMDPs in Python.☆251Updated 6 years ago
- Hybrid Reward Architecture☆78Updated 7 years ago
- This is my implementation of the Optimality Tightening☆37Updated 8 years ago
- Some Reinforcement Learning in Python☆116Updated 8 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 8 years ago
- ☆98Updated 9 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆55Updated 9 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆94Updated 7 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- ☆66Updated last year
- ☆159Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- learning to play atari games with reinforcement learning☆10Updated 9 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆132Updated 6 years ago
- Benchmarking Canonical Evolution Strategies for Playing Atari☆82Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Updated 6 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆34Updated 9 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆52Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 7 years ago
- ☆43Updated 8 years ago
- An implementation of Deep Q-Network using Caffe☆71Updated 10 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 7 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 9 years ago