herolab-uga / bsacLinks
Bayesian Soft Actor Critic
☆14Updated 2 years ago
Alternatives and similar repositories for bsac
Users that are interested in bsac are comparing it to the libraries listed below
Sorting:
- A clean and robust Pytorch implementation of SAC on discrete action space☆41Updated last year
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆94Updated 4 years ago
- Code snippets of Meta Reinforcement Learning algorithms☆39Updated 2 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆161Updated 2 years ago
- ☆40Updated 3 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆53Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆63Updated 4 years ago
- ☆104Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆54Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Updated 4 years ago
- PyTorch implementation of MATD3☆13Updated 5 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆27Updated 2 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆221Updated 6 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆165Updated last year
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆87Updated 4 years ago
- qmix☆23Updated 5 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆143Updated last year
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆53Updated 3 years ago
- An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.☆135Updated 11 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆88Updated 6 months ago
- ☆89Updated 4 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- The code for maddpg using pytorch☆170Updated 5 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆91Updated 11 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 7 months ago
- BranchingDQN☆50Updated 6 years ago
- ☆60Updated 5 years ago
- Study of paper "Meta reinforcement learning for sim-to-real domain adaptation"☆19Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆195Updated last year
- This is the official implementation of Multi-Agent PPO.☆120Updated 2 years ago