d3sm0 / gym_pomdp
Gym-like extensions for POMDP
☆55Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for gym_pomdp
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- A curated list of awesome Model-based reinforcement learning resources☆90Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆41Updated last year
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆32Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆77Updated 11 months ago
- ☆97Updated last year
- Gym environments modified with adversarial agents☆35Updated 7 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆40Updated 2 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.☆77Updated 2 years ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 4 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆62Updated last year
- ☆30Updated 11 months ago
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆35Updated 2 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆55Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆150Updated 2 weeks ago
- ☆82Updated 5 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- A standalone library to randomize various OpenAI Gym Environments☆60Updated 5 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 4 years ago
- Working directory for dynamics learning for experimental robots.☆55Updated 3 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆31Updated last year
- ☆90Updated 11 months ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago