A simple, continuous-control environment for OpenAI Gym
☆23Jan 1, 2023Updated 3 years ago
Alternatives and similar repositories for gym-cartpole-swingup
Users that are interested in gym-cartpole-swingup are comparing it to the libraries listed below
Sorting:
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- A PyTorch implementation of BCO☆12Jun 19, 2023Updated 2 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 2 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Natural Environment Benchmarks for Reinforcement Learning☆23May 9, 2019Updated 6 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- ☆59Sep 22, 2022Updated 3 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Python DSL for writing PDDL☆25Aug 13, 2021Updated 4 years ago
- Differential Dynamic Programming controller operating in OpenAI Gym environment.☆87Jun 11, 2020Updated 5 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 6 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- ☆25Apr 29, 2023Updated 2 years ago
- Official Code for "Relative Entropy Pathwise Policy Optimization"☆46Feb 27, 2026Updated last week
- ☆24Aug 9, 2024Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated this week
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 2 years ago
- Source for Action Schema Networks paper (AAAI'18)☆32Apr 6, 2023Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- ☆81Jul 8, 2022Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆84Jul 27, 2022Updated 3 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆87Oct 15, 2023Updated 2 years ago
- Whole body Inverse Kinematics based on Pinocchio and qpmad☆42Dec 31, 2023Updated 2 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆228May 19, 2024Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆42Jun 6, 2024Updated last year
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago