philtabor / Advanced-Replay-Strategies
☆13Updated 2 years ago
Alternatives and similar repositories for Advanced-Replay-Strategies
Users that are interested in Advanced-Replay-Strategies are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆59Updated 4 years ago
- ☆39Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 8 months ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆27Updated 5 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- an implementation of ATOC☆14Updated 3 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆84Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorch☆45Updated 2 years ago
- ☆44Updated 4 years ago
- ☆41Updated 5 years ago
- ☆96Updated 3 years ago
- Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).☆19Updated 3 months ago
- ☆49Updated 3 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- Distributional Soft Actor Critic☆53Updated 4 years ago
- PyTorch implementation of DDPG for continuous control tasks.☆46Updated 5 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆46Updated 8 months ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆14Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…☆17Updated last year