AndyYue1893 / reinforcement-learning-an-introductionLinks
Python Implementation of Reinforcement Learning: An Introduction
☆30Updated 6 years ago
Alternatives and similar repositories for reinforcement-learning-an-introduction
Users that are interested in reinforcement-learning-an-introduction are comparing it to the libraries listed below
Sorting:
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 6 years ago
- ☆126Updated 4 years ago
- rl-papers☆48Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 8 months ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆44Updated 5 years ago
- Source Code☆210Updated last year
- ☆54Updated 4 months ago
- ☆315Updated 3 years ago
- A plotter for reinforcement learning (RL)☆234Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆174Updated 2 years ago
- ☆105Updated 3 months ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- General Optimal control Problem Solver (GOPS), an easy-to-use PyTorch reinforcement learning solver package for industrial control.☆274Updated last week
- GitHub's code repository is all you need☆355Updated 2 years ago
- Transformer in RL for decision-making☆102Updated 2 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆404Updated 3 months ago
- OpenAI团队的深度强化学习教程中文版☆31Updated 5 years ago
- ☆171Updated 2 years ago
- basic algorithms of reinforcement learning☆214Updated 2 years ago
- RL-code for beginners. Enjoying!☆115Updated 5 years ago
- ☆90Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆94Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆165Updated last year
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆155Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆119Updated 3 years ago
- Code for running RL experiments on continuing (non-episodic) problems.☆20Updated 2 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆89Updated 5 years ago
- NeurIPS 2024 DACER☆145Updated 3 weeks ago
- ☆16Updated 3 years ago
- ☆66Updated last year