0xangelo / gym-cartpole-swingup
A simple, continuous-control environment for OpenAI Gym
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gym-cartpole-swingup
- A collection of RL algorithms written in JAX.☆94Updated 2 years ago
- ☆34Updated last year
- Revisiting Rainbow☆73Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- a modular reinforcement learning library with JAX agents☆22Updated 11 months ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Model-based reinforcement learning in TensorFlow☆53Updated 3 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆51Updated 3 years ago
- ☆42Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- ☆47Updated 4 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆61Updated 4 months ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆31Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆30Updated 11 months ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- RL-Toolkit: A Research Framework for Robotics☆18Updated 7 months ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- ☆28Updated last year
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago