henrycharlesworth/multi_action_head_PPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/henrycharlesworth/multi_action_head_PPO)

henrycharlesworth / multi_action_head_PPO

PPO with multi-head/autoregressive action outputs

☆47

Alternatives and similar repositories for multi_action_head_PPO

Users that are interested in multi_action_head_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cswinter / DeepCodeCraft
View on GitHub
Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.
☆21May 22, 2023Updated 3 years ago
thomashirtz / gym-hybrid
View on GitHub
Collection of OpenAI parametrized action-space environments.
☆70Mar 19, 2025Updated last year
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
CAI23sbP / Hybrid-Action-PPO
View on GitHub
Hybrid Action PPO in stable-baselines3
☆20Jan 14, 2025Updated last year
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MarcoMeter / neroRL
View on GitHub
Deep Reinforcement Learning Framework done with PyTorch
☆43Mar 12, 2025Updated last year
HannesStark / gnn-reinforcement-learning
View on GitHub
Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.
☆35Apr 11, 2021Updated 5 years ago
thomasahle / liars-dice
View on GitHub
Liar's Dice AI in Pytorch: www.dudo.ai
☆36Mar 4, 2025Updated last year
YifanYang1995 / GOODRL
View on GitHub
[ICLR 2025] Graph Assisted Offline-Online Deep Reinforcement Learning (GOODRL) for Dynamic Workflow Scheduling (DWS)
☆28Updated this week
toshikwa / rltorch
View on GitHub
A simple framework for distributed reinforcement learning in PyTorch.
☆16Apr 24, 2020Updated 6 years ago
JamesUnicomb / ReinforcementLearning
View on GitHub
Various python scripts for reinforcement learning algorithms.
☆10Aug 7, 2018Updated 7 years ago
cycraig / gym-platform
View on GitHub
OpenAI Gym environment for Platform
☆22May 17, 2019Updated 7 years ago
nobrowning / SEQ_HGNN
View on GitHub
Seq-HGNN: Learning Sequential Node Representation on Heterogeneous Graph
☆12Aug 2, 2023Updated 2 years ago
facebookresearch / entity-factored-rl
View on GitHub
Source code for the paper "Policy Architectures for Compositional Generalization in Control"
☆30May 19, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zkxinxin / HCMGNN
View on GitHub
Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction
☆12Aug 19, 2024Updated last year
chiamp / muzero-cartpole
View on GitHub
Applying DeepMind's MuZero algorithm to the cart pole environment in gym
☆22May 6, 2023Updated 3 years ago
yuchen-x / MacroMARL
View on GitHub
☆26Apr 16, 2024Updated 2 years ago
rlew631 / AutonomousVehicleSimulation
View on GitHub
Used Flow, Ray/RLlib and OpenAI Gym to simulate and train autonomous vehicles/human drivers in SUMO (Simulation of Urban Mobility)
☆25Dec 15, 2020Updated 5 years ago
airjerry1216 / VLSI-Physical-Design-Automation
View on GitHub
NTHU CS6135 VLSI實體設計自動化
☆11Mar 12, 2022Updated 4 years ago
Dingjeson / Benchmark-instances-for-DJSP
View on GitHub
The repository contains 15 benchmarks for dynamic job shop scheduling problem.
☆13Apr 30, 2020Updated 6 years ago
ssokota / mmd
View on GitHub
Code for magnetic mirror descent.
☆20Oct 5, 2023Updated 2 years ago
vwxyzjn / ppo-implementation-details
View on GitHub
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
☆943Mar 23, 2024Updated 2 years ago
junkwhinger / PPO_PyTorch
View on GitHub
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆11Jun 26, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shanest / decisions-games-ai
View on GitHub
Lecture notes for a course on Decision and Game Theory for undergraduates studying AI
☆13Dec 14, 2018Updated 7 years ago
mgwoo / RePlAce
View on GitHub
☆11Mar 14, 2022Updated 4 years ago
qiongwu86 / Federated-SSL-task-offloading-and-resource-allocation
View on GitHub
☆46Jul 24, 2024Updated last year
shah314 / samultichoiceknapsack
View on GitHub
Simulated Annealing for the Multiple Choice Multidimensional Knapsack Problem
☆16May 16, 2020Updated 6 years ago
FredericoMetelo / TaskOffloadingAgentLibrary
View on GitHub
☆10Jul 26, 2024Updated last year
XinJingHao / Actor-Sharer-Learner
View on GitHub
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆22Dec 29, 2024Updated last year
T3AS / MAD-ARL
View on GitHub
Python project for the paper "Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving Poli…
☆13Feb 24, 2023Updated 3 years ago
compsciencelab / ppo_D
View on GitHub
This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…
☆19Oct 5, 2021Updated 4 years ago
ChristosKap / policy_consolidation
View on GitHub
Code for Policy Consolidation for Continual Reinforcement Learning
☆10May 12, 2019Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
galdl / rl_delay_basic
View on GitHub
Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.
☆14Sep 12, 2023Updated 2 years ago
davide97l / PPO-GAIL-cartpole
View on GitHub
GAIL learning to imitate PPO playing CartPole.
☆13May 27, 2021Updated 5 years ago
SimRey / HPPO
View on GitHub
☆11Sep 13, 2025Updated 10 months ago
oowekyala / a-maze-in-python
View on GitHub
Maze generation & solving with Python
☆10Oct 2, 2021Updated 4 years ago
rblaughol / Vehicular-T-Pattern-Tree
View on GitHub
Yan, Ruibin, Yijun Gu, Zeyu Zhang, and Shouzhong Jiao. 2023. "Vehicle Trajectory Prediction Method for Task Offloading in Vehicular Edge …
☆14Sep 27, 2023Updated 2 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
martinkubecka / C2Detective
View on GitHub
Application for detecting command and control (C2) communication through network traffic analysis.
☆17May 12, 2023Updated 3 years ago