herolab-uga / bsac
Bayesian Soft Actor Critic
☆15Updated 2 years ago
Alternatives and similar repositories for bsac:
Users that are interested in bsac are comparing it to the libraries listed below
- A clean and robust Pytorch implementation of SAC on discrete action space☆34Updated 3 months ago
- ☆40Updated 3 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆41Updated 2 years ago
- ☆94Updated 3 years ago
- ☆40Updated 3 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆79Updated 4 years ago
- Hybrid action space reinforcement learning algorithms.☆12Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 3 years ago
- Communication using GNN in MARL☆18Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆53Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆31Updated 3 years ago
- Implementation for mSAC methods in PyTorch☆40Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆69Updated 2 months ago
- BranchingDQN☆49Updated 6 years ago
- QMIX implemented in TensorFlow 2☆17Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated 4 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆19Updated 2 months ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆23Updated 2 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆140Updated last year
- ☆38Updated 2 years ago
- my code for paper Parameterized-DQN☆21Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆118Updated 10 months ago
- ☆20Updated last year
- Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…☆46Updated 10 months ago
- ☆58Updated 4 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆40Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- qmix☆22Updated 4 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆26Updated last year
- Code for Weighted QMIX☆129Updated 4 years ago