wisnunugroho21 / reinforcement_learning_v_mpo
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for reinforcement_learning_v_mpo
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆29Updated 2 years ago
- ☆21Updated 7 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆24Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- code for polite☆11Updated 8 months ago
- ☆22Updated 7 months ago
- ☆20Updated 6 months ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆11Updated last year
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆24Updated 2 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆13Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆14Updated 6 months ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆17Updated last year
- curriculum☆20Updated last year
- ☆21Updated 2 years ago
- Evaluation of TD-MPC2.☆22Updated 10 months ago
- A set of environments utilizing pybullet for simulation of robotic manipulation tasks.☆25Updated 3 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆32Updated last year
- ☆38Updated last year
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆31Updated 4 years ago
- ☆22Updated 5 months ago
- [IROS 22'] Model-free Neural Lyapunov Control☆20Updated last year
- ☆14Updated 8 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆23Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆34Updated last year
- Implementation of Sim2Seg (John So*, Amber Xie*, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali-akbar Agha-mohammad, Pieter Abbeel, Ste…☆31Updated last year
- Model-based Policy Gradients☆30Updated 4 years ago