deligentfool/policy_based_RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deligentfool/policy_based_RL)

deligentfool / policy_based_RL

The implement of the policy gradient RL algorithm with pytorch

☆41

Alternatives and similar repositories for policy_based_RL

Users that are interested in policy_based_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chagmgang / pytorch_ppo_rl
View on GitHub
Pytorch implementation of intrinsic curiosity module with proximal policy optimization
☆55Dec 20, 2018Updated 7 years ago
simsimiSION / pymarl-algorithm-extension-via-starcraft
View on GitHub
☆13Aug 15, 2020Updated 5 years ago
Ullar-Kask / TD3-PER
View on GitHub
An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer
☆25Aug 14, 2019Updated 6 years ago
deligentfool / dqn_zoo
View on GitHub
The implement of all kinds of dqn reinforcement learning with Pytorch
☆97Mar 25, 2021Updated 5 years ago
deligentfool / GAIL_pytorch
View on GitHub
The implement of GAIL with pytorch
☆14Mar 11, 2020Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
adik993 / ppo-pytorch
View on GitHub
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆148Jan 12, 2019Updated 7 years ago
kngwyu / Rainy
View on GitHub
Deep RL agents with PyTorch
☆35Sep 25, 2021Updated 4 years ago
LukasSchaefer / MSc_Curiosity_MARL
View on GitHub
MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning
☆13Aug 16, 2019Updated 6 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
nicklashansen / a3c
View on GitHub
Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)
☆10Oct 11, 2019Updated 6 years ago
deligentfool / maddpg
View on GitHub
Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch
☆10Aug 2, 2020Updated 5 years ago
binz98 / Multi_Agent_Stackelberg_Decision_Transformer
View on GitHub
Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"
☆15Aug 30, 2024Updated last year
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
PhDChe / Poker-1
View on GitHub
Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping
☆13Oct 23, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
deligentfool / HAVEN
View on GitHub
Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"
☆27Oct 22, 2022Updated 3 years ago
benellis3 / mappo
View on GitHub
☆18Aug 14, 2023Updated 2 years ago
xiangqianL / ESPerHFL
View on GitHub
Personalized Client-Edge-Cloud Hierarchical Federated Learning on Non-IID Data
☆11Sep 7, 2023Updated 2 years ago
wisnunugroho21 / reinforcement_learning_ppo_rnd
View on GitHub
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆57Nov 10, 2025Updated 8 months ago
cqian19 / qmix-plus
View on GitHub
Improving upon state of the art cooperative deep reinforcement learning in StarCraft II
☆13May 16, 2019Updated 7 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
duckzhao / air_campaign_rl
View on GitHub
基于强化学习的游戏空战推演
☆13May 8, 2021Updated 5 years ago
PeixiLiu / humanMotionRadar
View on GitHub
Generate Micro-Doppler signature of human motion by radar
☆11Jul 2, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
BUPT-ANTlab / PEPCRL-MVP
View on GitHub
☆17Oct 25, 2023Updated 2 years ago
orrivlin / MountainCar_DQN_RND
View on GitHub
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
☆41Jan 28, 2019Updated 7 years ago
navid-naderi / GraphMIX
View on GitHub
Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning
☆36Feb 13, 2021Updated 5 years ago
jcwleo / mario_rl
View on GitHub
☆69Nov 30, 2018Updated 7 years ago
younggyoseo / pytorch-nfsp
View on GitHub
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
☆48Nov 30, 2018Updated 7 years ago
Ja1r0 / DQN-play-SuperMario
View on GitHub
implement the classic reinforcement learning algorithm DQN to play supermariobrother
☆15Dec 18, 2017Updated 8 years ago
hahayonghuming / VDACs
View on GitHub
Value-Decomposition Multi-Agent Actor-Critics
☆42Dec 8, 2022Updated 3 years ago
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pr-shukla / maddpg-keras
View on GitHub
Implementation Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm in keras
☆21Dec 19, 2023Updated 2 years ago
WUR-ABE / rl_drone_object_search
View on GitHub
UAV-based path planning for efficient localization of non-uniformly distributed weeds using prior knowledge: A reinforcement-learning app…
☆15Jul 1, 2025Updated last year
yaoliucs / PQL
View on GitHub
Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"
☆11Oct 22, 2020Updated 5 years ago
brenda-Zheng / Exponential-Predefined-Time-Trajectory-Tracking-Control
View on GitHub
☆20Nov 21, 2023Updated 2 years ago
erichson / JumpReLU
View on GitHub
Jump ReLU
☆12Apr 8, 2019Updated 7 years ago
ducmngx / DDPG-UAV-Efficiency
View on GitHub
Using DDPG agent to control UAV system with energy efficiency
☆16Jan 7, 2023Updated 3 years ago
Kyushik / DRL
View on GitHub
Repository for codes of 'Deep Reinforcement Learning'
☆218Oct 4, 2019Updated 6 years ago