timoklein / alphazero-gymLinks
AlphaZero for continuous control tasks
☆23Updated 3 years ago
Alternatives and similar repositories for alphazero-gym
Users that are interested in alphazero-gym are comparing it to the libraries listed below
Sorting:
- A collection of RL algorithms written in JAX.☆104Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- Modular framework for Reinforcement Learning in python☆180Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆225Updated last year
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆151Updated 4 years ago
- Benchmarking RL generalization in an interpretable way.☆173Updated last month
- ☆323Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆78Updated 3 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Updated 3 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Updated 2 years ago
- Baselines for gymnax 🤖☆73Updated 2 years ago
- Code for the paper "Batch size invariance for policy optimization"☆56Updated 2 years ago
- ☆32Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190Updated 3 years ago
- JAX implementations of core Deep RL algorithms☆82Updated 3 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"