wisnunugroho21 / reinforcement_learning_v_mpoLinks
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆17Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_v_mpo
Users that are interested in reinforcement_learning_v_mpo are comparing it to the libraries listed below
Sorting:
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 months ago
- ☆25Updated last year
- ☆23Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- Model-based Policy Gradients☆32Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Updated 2 years ago
- DecentralizedLearning☆25Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆59Updated 2 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆49Updated 3 years ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆13Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 3 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 11 months ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆89Updated last year
- ☆23Updated last year
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆30Updated 2 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 3 years ago
- ☆35Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Hierarchical Reinforcement Learning (batteries included)☆47Updated 6 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆24Updated 3 years ago