henrycharlesworth / multi_action_head_PPOView external linksLinks
PPO with multi-head/autoregressive action outputs
☆45Mar 4, 2021Updated 4 years ago
Alternatives and similar repositories for multi_action_head_PPO
Users that are interested in multi_action_head_PPO are comparing it to the libraries listed below
Sorting:
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21May 22, 2023Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆69Mar 19, 2025Updated 10 months ago
- ☆21Dec 22, 2020Updated 5 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- Hybrid Action PPO in stable-baselines3☆17Jan 14, 2025Updated last year
- Lecture notes for a course on Decision and Game Theory for undergraduates studying AI☆13Dec 14, 2018Updated 7 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 4 years ago
- GAIL learning to imitate PPO playing CartPole.☆12May 27, 2021Updated 4 years ago
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.☆35Apr 11, 2021Updated 4 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Mar 12, 2025Updated 11 months ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- Gym environment for playing Wordle with RL agents☆42Feb 8, 2022Updated 4 years ago
- A simple framework for distributed reinforcement learning in PyTorch.☆16Apr 24, 2020Updated 5 years ago
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆21May 6, 2023Updated 2 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 4 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Code for magnetic mirror descent.☆17Oct 5, 2023Updated 2 years ago
- ☆53Apr 11, 2023Updated 2 years ago
- ☆24Nov 1, 2022Updated 3 years ago
- ☆25Apr 16, 2024Updated last year
- OpenAI Gym environment for Platform☆21May 17, 2019Updated 6 years ago
- Various adaptive control implementations (university project)☆22Jun 22, 2017Updated 8 years ago
- Used Flow, Ray/RLlib and OpenAI Gym to simulate and train autonomous vehicles/human drivers in SUMO (Simulation of Urban Mobility)☆24Dec 15, 2020Updated 5 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Aug 30, 2024Updated last year
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 2 months ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Implementations of a large collection of reinforcement learning algorithms.☆28Nov 30, 2023Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- A Reinforcement Learning Project using PPO + Transformer☆86Jul 21, 2023Updated 2 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A Doom Source Port based on SDL2☆29Jan 30, 2024Updated 2 years ago
- DAGGEN: A synthethic task graph generator☆78Jun 22, 2022Updated 3 years ago
- distributed RL spaghetti al arabiata☆32Mar 29, 2019Updated 6 years ago
- Counterfactual Regret Minimization for a simplified version of Texas Hold'em poker☆28Oct 29, 2020Updated 5 years ago