sparisi / td-reg
TD-Regularized Actor-Critic Methods
☆36Updated 5 years ago
Alternatives and similar repositories for td-reg:
Users that are interested in td-reg are comparing it to the libraries listed below
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- Distributed DDPG implementation in pytorch☆9Updated 6 years ago
- ☆36Updated 8 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- ☆70Updated 5 years ago
- ☆54Updated 7 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- ☆98Updated 2 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- ☆35Updated 6 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Updated 6 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- ☆11Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Updated 6 years ago
- ☆72Updated 4 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆23Updated 10 months ago
- Experiments on a discrete mean field game model of population dynamics with reinforcement learning☆34Updated last year