herolab-uga / bsac
Bayesian Soft Actor Critic
☆14Updated 2 years ago
Alternatives and similar repositories for bsac:
Users that are interested in bsac are comparing it to the libraries listed below
- A clean and robust Pytorch implementation of SAC on discrete action space☆35Updated 5 months ago
- ☆40Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆21Updated 4 months ago
- Communication using GNN in MARL☆20Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆56Updated 3 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆43Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- qmix☆22Updated 4 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆22Updated 2 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆24Updated 2 years ago
- ☆15Updated 5 years ago
- BranchingDQN☆49Updated 6 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated last month
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- ☆96Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆101Updated 2 years ago
- ☆21Updated last year
- ☆39Updated 2 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆148Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- ☆44Updated 4 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆10Updated last year
- ☆59Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- Implementation of DyMA-CL, MARL algorithm☆26Updated 5 years ago
- ☆42Updated 3 years ago