herolab-uga / bsac
Bayesian Soft Actor Critic
☆14Updated 2 years ago
Alternatives and similar repositories for bsac
Users that are interested in bsac are comparing it to the libraries listed below
Sorting:
- A clean and robust Pytorch implementation of SAC on discrete action space☆36Updated 6 months ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Updated 2 years ago
- ☆40Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆56Updated 3 years ago
- ☆96Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆20Updated 5 months ago
- ☆21Updated last year
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆46Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 2 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆22Updated 2 years ago
- Communication using GNN in MARL☆20Updated 3 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- qmix☆22Updated 4 years ago
- BranchingDQN☆50Updated 6 years ago
- QMIX implemented in TensorFlow 2☆17Updated 3 years ago
- ☆15Updated 5 years ago
- my code for paper Parameterized-DQN☆22Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆38Updated 3 years ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- ☆39Updated 2 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆149Updated last year
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆14Updated 3 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆84Updated 4 years ago
- ☆85Updated 3 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆36Updated 4 years ago