AndyYue1893 / Hands-On-Reinforcement-Learning-With-PythonLinks
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
☆28Updated 5 years ago
Alternatives and similar repositories for Hands-On-Reinforcement-Learning-With-Python
Users that are interested in Hands-On-Reinforcement-Learning-With-Python are comparing it to the libraries listed below
Sorting:
- Python Implementation of Reinforcement Learning: An Introduction☆31Updated 5 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆90Updated 2 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 5 months ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆62Updated 4 years ago
- RL-code for beginners. Enjoying!☆115Updated 5 years ago
- Intelligent control algorithm and simulation environment.☆17Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- 多智能体强化学习☆100Updated 6 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- ☆124Updated 3 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆44Updated 4 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 5 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆102Updated 3 years ago
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆24Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆159Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 5 months ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- BipedalWalker & BipedalWalkerHardcore solved by SAC☆25Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆58Updated 2 years ago
- TD3 in Pytorch☆34Updated 3 years ago
- 强化学习面试(未完待续)☆35Updated 5 years ago
- The implement of the policy gradient RL algorithm with pytorch☆39Updated 4 years ago
- A collection of multi agent environments based on OpenAI gym.☆26Updated last year
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆32Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆181Updated last year
- OpenAI团队的深度强化学习教程中文版☆31Updated 5 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆4Updated 6 years ago