The implement of the policy gradient RL algorithm with pytorch
☆40Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for policy_based_RL
Users that are interested in policy_based_RL are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆96Mar 25, 2021Updated 4 years ago
- Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"☆15Aug 30, 2024Updated last year
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆24Aug 14, 2019Updated 6 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 5 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆26Oct 22, 2022Updated 3 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Dec 8, 2022Updated 3 years ago
- 5GTANGO Smart Manufacturing Pilot☆13May 1, 2023Updated 2 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Dec 8, 2022Updated 3 years ago
- AI path planning and controller for formations of drones.☆14Apr 8, 2021Updated 4 years ago
- This is codes of PTDE algorithms.☆14Jun 18, 2024Updated last year
- Generate Micro-Doppler signature of human motion by radar☆12Jul 2, 2023Updated 2 years ago
- Source code for SWIFT, an efficient reward model.☆18Jan 13, 2026Updated last month
- Evolving Objects (EO): an Evolutionary Computation Framework☆12Sep 9, 2016Updated 9 years ago
- Python package for the paper "Inductive Document Network Embedding with Topic-Word Attention" (https://arxiv.org/pdf/2001.03369.pdf)☆17Dec 8, 2022Updated 3 years ago
- Learning in Noisy MDP (which is governed by stochastic, exogenous input processes) with input-dependent baseline☆11Aug 7, 2020Updated 5 years ago
- Python and cpp implementation of the Batch Informed Trees algorithm from scratch with real-time performance in R2 space☆14Jun 4, 2023Updated 2 years ago
- ☆10Mar 3, 2020Updated 6 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- ☆17Feb 5, 2026Updated 3 weeks ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- This is official code for ASFL.☆21Mar 3, 2025Updated last year
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- NeurIPS 2024☆14Oct 29, 2025Updated 4 months ago
- Documentation:☆13May 2, 2025Updated 10 months ago
- Multi-Critic Policy Gradient Optimization for Quadcopter Coordination☆14Aug 10, 2021Updated 4 years ago
- ☆12Nov 29, 2022Updated 3 years ago
- PPO with Hindsight Experience Replay (HER)☆11May 8, 2018Updated 7 years ago
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- The Soft Cosine Measure system developed for the ARQMath-3 shared task evaluation of math information retrieval systems☆13Sep 8, 2022Updated 3 years ago
- Reimplementation of SALICON saliency model in TensorFlow☆10Nov 22, 2022Updated 3 years ago
- Python implementation of state-of-art meta-heuristic and evolutionary optimization algorithms.☆12Jun 29, 2022Updated 3 years ago