herolab-uga / bsacLinks
Bayesian Soft Actor Critic
☆15Updated 2 years ago
Alternatives and similar repositories for bsac
Users that are interested in bsac are comparing it to the libraries listed below
Sorting:
- A clean and robust Pytorch implementation of SAC on discrete action space☆41Updated last year
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆57Updated 3 years ago
- ☆40Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆163Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆91Updated 5 years ago
- ☆106Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Updated 4 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆223Updated 6 years ago
- PyTorch implementation of MATD3☆13Updated 5 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆64Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Updated 4 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆90Updated 7 months ago
- Code snippets of Meta Reinforcement Learning algorithms☆39Updated 2 years ago
- ☆62Updated 5 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆97Updated last year
- my code for paper Parameterized-DQN☆25Updated 4 years ago
- qmix☆23Updated 5 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆55Updated 3 years ago
- ☆46Updated 3 years ago
- ☆20Updated 2 years ago
- BranchingDQN☆50Updated 6 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆55Updated 4 years ago
- ☆220Updated 2 years ago
- An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.☆139Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆168Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆151Updated last year
- This is the official implementation of Multi-Agent PPO.☆125Updated 2 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆26Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆207Updated last year
- Communication using GNN in MARL☆31Updated 3 years ago