GhadaSokar / Dynamic-Sparse-Training-for-Deep-Reinforcement-Learning
[IJCAI 2022] "Dynamic Sparse Training for Deep Reinforcement Learning" by Ghada Sokar, Elena Mocanu , Decebal Constantin Mocanu, Mykola Pechenizkiy, and Peter Stone.
☆13Updated 2 years ago
Alternatives and similar repositories for Dynamic-Sparse-Training-for-Deep-Reinforcement-Learning:
Users that are interested in Dynamic-Sparse-Training-for-Deep-Reinforcement-Learning are comparing it to the libraries listed below
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 3 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Constrained Exploration and Recovery from Experience Shaping☆22Updated 5 years ago
- ☆11Updated 3 years ago
- Implementation of Robust Adversarial Reinforcement Learning☆13Updated 7 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- The implementation of Discriminator Soft Actor Critic☆15Updated 5 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 8 months ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆43Updated 5 years ago
- DecentralizedLearning☆24Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆17Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆21Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17Updated 2 years ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13Updated 11 months ago
- ☆9Updated 5 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆41Updated 6 months ago
- Sample-Efficient Automated Deep Reinforcement Learning☆34Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago