AndyYue1893 / reinforcement-learning-an-introductionLinks
Python Implementation of Reinforcement Learning: An Introduction
☆31Updated 5 years ago
Alternatives and similar repositories for reinforcement-learning-an-introduction
Users that are interested in reinforcement-learning-an-introduction are comparing it to the libraries listed below
Sorting:
- Source Code☆188Updated last year
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- ☆124Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 4 months ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆53Updated 5 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆43Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆172Updated last year
- ☆51Updated 3 weeks ago
- ☆75Updated last year
- ☆103Updated 4 months ago
- ☆16Updated 2 years ago
- rl-papers☆47Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆154Updated last year
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆198Updated 8 months ago
- ☆311Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆168Updated 7 months ago
- ☆27Updated 4 years ago
- ☆165Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆55Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- A collection of offline reinforcement learning algorithms.☆189Updated 7 months ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆78Updated 2 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- Transformer in RL for decision-making☆96Updated 2 years ago
- Hello😜☆31Updated 4 years ago
- ☆63Updated last month
- ☆154Updated 6 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆177Updated last year
- NeurIPS 2024 DACER☆120Updated 3 weeks ago