AndyYue1893 / reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
☆30Updated 5 years ago
Alternatives and similar repositories for reinforcement-learning-an-introduction
Users that are interested in reinforcement-learning-an-introduction are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆53Updated 5 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- ☆124Updated 3 years ago
- ☆46Updated last month
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 months ago
- rl-papers☆47Updated 2 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆169Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆145Updated 11 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆343Updated last month
- ☆165Updated last year
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆42Updated 4 years ago
- A collection of multi agent environments based on OpenAI gym.☆23Updated last year
- Source Code☆182Updated last year
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated last week
- Transformer in RL for decision-making☆97Updated 2 years ago
- General Optimal control Problem Solver (GOPS), an easy-to-use PyTorch reinforcement learning solver package for industrial control.☆255Updated 2 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- OpenAI团队的深度强化学习教程中文版☆29Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorch☆45Updated 2 years ago
- ☆103Updated 3 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆75Updated last month
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- ☆60Updated last week
- ☆72Updated last year
- Hello😜☆31Updated 4 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- A plotter for reinforcement learning (RL)☆223Updated 3 years ago
- ☆27Updated 4 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 5 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year