AadityaRavindran / gym-cartpolemod
Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller
☆9Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for gym-cartpolemod
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- ☆17Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated this week
- A repository for code of reinforcement learning algorithms with PyTorch☆29Updated 3 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆37Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆18Updated 3 years ago
- A modified version of the cart-pole OpenAI Gym environment for testing different control policies☆13Updated 4 months ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 5 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆15Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- POMDP wrappers for OpenAI Gym☆15Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆36Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆23Updated 5 years ago
- ☆28Updated last year
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago