philipjball / TD3_PyTorch
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆9Updated 3 years ago
Alternatives and similar repositories for TD3_PyTorch:
Users that are interested in TD3_PyTorch are comparing it to the libraries listed below
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated 2 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 5 months ago
- ☆11Updated 2 years ago
- Deep Learning (FS 2020)☆16Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 5 months ago
- ☆47Updated 3 years ago
- ☆12Updated 4 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- ☆17Updated 2 years ago
- ☆26Updated 2 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆20Updated 2 years ago
- ☆91Updated 4 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- ☆54Updated 2 years ago
- Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning☆15Updated 2 years ago
- ☆42Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- DecentralizedLearning☆23Updated 2 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆42Updated 5 years ago
- There will be updates later☆84Updated 5 years ago
- ☆28Updated 3 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆57Updated 4 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆54Updated 8 months ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆26Updated 3 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆19Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆58Updated 3 years ago
- Distributional Soft Actor Critic☆50Updated 4 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆69Updated last year
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17Updated 2 years ago