JM-Kim-94 / rl-pendulumLinks
Open AI Gym - Pendulum-v1 reinforcement learning (DQN, SAC)
☆19Updated last year
Alternatives and similar repositories for rl-pendulum
Users that are interested in rl-pendulum are comparing it to the libraries listed below
Sorting:
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆27Updated 3 weeks ago
- The implementation of LSTM-TD3.☆81Updated 2 years ago
- A Reinforcement Learning Project using PPO + LSTM☆80Updated last year
- Implementation of Pareto Deep Q Networks in a multi-objective Gym Reinforcement Learning Environment☆16Updated last year
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆34Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆98Updated 3 weeks ago
- Heterogeneous Multi-Robot Reinforcement Learning☆48Updated 8 months ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Updated 5 years ago
- ☆40Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆77Updated last month
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆146Updated last year
- Learning multi-agent policies for flocking using graph neural networks☆76Updated 2 years ago
- 深度强化学习各算法介绍与Pytorch实现☆54Updated 10 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆89Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- TD3 in Pytorch☆34Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 8 months ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆30Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆29Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆18Updated last year
- ☆184Updated 3 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆27Updated 2 years ago