philtabor / Advanced-Replay-Strategies
☆13Updated 2 years ago
Alternatives and similar repositories for Advanced-Replay-Strategies:
Users that are interested in Advanced-Replay-Strategies are comparing it to the libraries listed below
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- ☆40Updated 3 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆35Updated 4 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- There will be updates later☆84Updated 5 years ago
- PyTorch implementation of Constrained Policy Optimization☆53Updated 3 years ago
- ☆39Updated 2 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆148Updated last year
- ☆20Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- ☆59Updated 4 years ago
- Code for Weighted QMIX☆135Updated 4 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 4 years ago
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- ☆21Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆74Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Updated last year
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- Transformer in RL for decision-making☆98Updated 2 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 5 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- ☆41Updated 5 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago