ramp-kits / rl_simulatorLinks
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Updated 4 years ago
Alternatives and similar repositories for rl_simulator
Users that are interested in rl_simulator are comparing it to the libraries listed below
Sorting:
- Model-based reinforcement learning in TensorFlow☆56Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated last week
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Updated 2 years ago
- ☆35Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Updated last year
- Mirror Descent Policy Optimization☆41Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- ☆30Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- My Body Is A Cage☆41Updated 4 years ago
- ☆32Updated 2 years ago
- ☆23Updated 3 years ago
- ☆54Updated last year
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- ☆99Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆17Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Updated 2 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆49Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization☆15Updated 5 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 3 years ago
- Proximal Policy Option-Critic☆26Updated 6 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 3 years ago
- ☆15Updated 5 years ago