JM-Kim-94 / rl-pendulum
Open AI Gym - Pendulum-v1 reinforcement learning (DQN, SAC)
☆19Updated last year
Alternatives and similar repositories for rl-pendulum:
Users that are interested in rl-pendulum are comparing it to the libraries listed below
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆26Updated 8 months ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 3 years ago
- A Reinforcement Learning Project using PPO + LSTM☆64Updated last year
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Updated 5 years ago
- A clean and robust Pytorch implementation of TD3 on continuous action space☆26Updated 9 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆27Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆31Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- Transformer in RL for decision-making☆96Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆138Updated 11 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last month
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆24Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆70Updated 9 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆165Updated 11 months ago
- The official code releasement of publications in MARL field of TJU RL lab.☆71Updated 2 years ago
- ☆40Updated 3 years ago
- Official implementation of "Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand☆77Updated 3 years ago
- Learning multi-agent policies for flocking using graph neural networks☆74Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆60Updated 9 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆68Updated 8 months ago
- The implementation of LSTM-TD3.☆77Updated 2 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆35Updated 4 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆22Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorch☆38Updated 2 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆123Updated 11 months ago