AadityaRavindran / gym-cartpolemod
Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller
☆9Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for gym-cartpolemod
- A modified version of the cart-pole OpenAI Gym environment for testing different control policies☆13Updated 3 months ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆53Updated 5 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆23Updated 5 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆25Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 5 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆17Updated 3 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆25Updated 5 years ago
- ☆16Updated 3 years ago
- ☆81Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- ☆90Updated 11 months ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆15Updated 6 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- ☆27Updated 3 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆37Updated last year
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- ☆33Updated 4 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago