The implement of the policy gradient RL algorithm with pytorch
☆40Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for policy_based_RL
Users that are interested in policy_based_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 15, 2020Updated 5 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆25Aug 14, 2019Updated 6 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆96Mar 25, 2021Updated 5 years ago
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"☆15Aug 30, 2024Updated last year
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)☆10Oct 11, 2019Updated 6 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆13Oct 23, 2020Updated 5 years ago
- Dockerfile that is used for the JModelica regression testing of the Buildings library and of BuildingsPy☆16Nov 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆26Oct 22, 2022Updated 3 years ago
- ☆18Aug 14, 2023Updated 2 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- Personalized Client-Edge-Cloud Hierarchical Federated Learning on Non-IID Data☆11Sep 7, 2023Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 5 months ago
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- Generate Micro-Doppler signature of human motion by radar☆12Jul 2, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆48Nov 30, 2018Updated 7 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Dec 8, 2022Updated 3 years ago
- Implementation Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm in keras☆21Dec 19, 2023Updated 2 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is official code for ASFL.☆22Mar 3, 2025Updated last year
- ☆17Oct 25, 2023Updated 2 years ago
- ☆14May 30, 2019Updated 6 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last month
- ☆19Nov 21, 2023Updated 2 years ago
- Using DDPG agent to control UAV system with energy efficiency☆16Jan 7, 2023Updated 3 years ago
- Repository for codes of 'Deep Reinforcement Learning'☆218Oct 4, 2019Updated 6 years ago