wisnunugroho21 / reinforcement_learning_v_mpoLinks
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆17Updated 3 years ago
Alternatives and similar repositories for reinforcement_learning_v_mpo
Users that are interested in reinforcement_learning_v_mpo are comparing it to the libraries listed below
Sorting:
- ☆21Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- ☆18Updated 4 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆18Updated 2 years ago
- ☆24Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆71Updated 11 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆13Updated last year
- ☆22Updated last year
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆47Updated 4 years ago
- Model-based Policy Gradients☆31Updated 5 years ago
- code for polite☆11Updated last year
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆25Updated 2 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆21Updated last year
- PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer☆10Updated 11 months ago
- ☆18Updated 2 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Evaluation of TD-MPC2.☆22Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 7 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Updated last year
- The implementation of Discriminator Soft Actor Critic☆15Updated 5 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆22Updated 7 months ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Updated 2 years ago
- Model-based reinforcement learning using CEM, MPC and PETS☆16Updated 5 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆50Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]