0xangelo / gym-cartpole-swingupView external linksLinks
A simple, continuous-control environment for OpenAI Gym
☆23Jan 1, 2023Updated 3 years ago
Alternatives and similar repositories for gym-cartpole-swingup
Users that are interested in gym-cartpole-swingup are comparing it to the libraries listed below
Sorting:
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated last month
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 5 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Oct 6, 2021Updated 4 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆79Nov 19, 2022Updated 3 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Natural Environment Benchmarks for Reinforcement Learning☆23May 9, 2019Updated 6 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- ☆59Sep 22, 2022Updated 3 years ago
- Python DSL for writing PDDL☆25Aug 13, 2021Updated 4 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Differential Dynamic Programming controller operating in OpenAI Gym environment.☆87Jun 11, 2020Updated 5 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 5 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Official Code for "Relative Entropy Pathwise Policy Optimization"☆45Updated this week
- ☆25Apr 29, 2023Updated 2 years ago
- ☆24Aug 9, 2024Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- Study Group of Model-based RL, 高橋研究室のモデルベース強化学習勉強会のスライドのまとめです☆25Jun 10, 2019Updated 6 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- Source for Action Schema Networks paper (AAAI'18)☆32Apr 6, 2023Updated 2 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 2 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- ☆81Jul 8, 2022Updated 3 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆85Oct 15, 2023Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- Whole body Inverse Kinematics based on Pinocchio and qpmad☆40Dec 31, 2023Updated 2 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 5 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- COMS30017 Computational Neuroscience☆11Jan 7, 2022Updated 4 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago