wisnunugroho21 / reinforcement_learning_v_mpoLinks
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆17Updated 3 years ago
Alternatives and similar repositories for reinforcement_learning_v_mpo
Users that are interested in reinforcement_learning_v_mpo are comparing it to the libraries listed below
Sorting:
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated last month
- ☆22Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆40Updated 3 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆32Updated 2 years ago
- DecentralizedLearning☆25Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆50Updated last year
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 9 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- ☆25Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- ☆43Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆88Updated last year
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆40Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated last month
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆59Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆81Updated 2 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Updated 2 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆48Updated 3 years ago
- RL Algorithms for Visual Continuous Control☆33Updated 2 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆15Updated 5 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- ☆36Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- ☆53Updated 7 months ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Updated 2 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆64Updated last year