The implement of the policy gradient RL algorithm with pytorch
☆40Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for policy_based_RL
Users that are interested in policy_based_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆24Aug 14, 2019Updated 6 years ago
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)☆10Oct 11, 2019Updated 6 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆13Oct 23, 2020Updated 5 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆25Oct 22, 2022Updated 3 years ago
- ☆18Aug 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- Personalized Client-Edge-Cloud Hierarchical Federated Learning on Non-IID Data☆11Sep 7, 2023Updated 2 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆18Sep 14, 2025Updated 7 months ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- 基于强化学习的游戏空战推演☆13May 8, 2021Updated 4 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- Generate Micro-Doppler signature of human motion by radar☆12Jul 2, 2023Updated 2 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- ☆69Nov 30, 2018Updated 7 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆48Nov 30, 2018Updated 7 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Dec 8, 2022Updated 3 years ago
- This is codes of PTDE algorithms.☆14Jun 18, 2024Updated last year
- Implementation Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm in keras☆21Dec 19, 2023Updated 2 years ago
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 2 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This is official code for ASFL.☆22Mar 3, 2025Updated last year
- ☆17Oct 25, 2023Updated 2 years ago
- ☆14May 30, 2019Updated 6 years ago
- ☆11Oct 26, 2022Updated 3 years ago
- ☆19Nov 21, 2023Updated 2 years ago
- Using DDPG agent to control UAV system with energy efficiency☆16Jan 7, 2023Updated 3 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last month