zzzxxxttt / pytorch_simple_RL
Simple pytorch implmentation of reinforcement learning algorithms
☆25Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch_simple_RL
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- A PyTorch implementation of SSINet.☆16Updated 4 years ago
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation☆49Updated 4 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆112Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- [ICLR 2022] Official implementation of paper: Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization☆42Updated last year
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- ☆5Updated last year
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆66Updated last year
- ☆17Updated 4 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆32Updated 4 years ago
- This is MPE-pytorch, fix some bugs.☆10Updated 4 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆46Updated 5 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆56Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆116Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆129Updated 3 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 3 years ago
- ☆10Updated 3 years ago
- Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20☆73Updated last year
- ☆119Updated last year
- Semantic Predictive Control☆27Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆83Updated 5 years ago
- personal paper reading on neural motion planner and controller☆24Updated 4 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago