wisnunugroho21 / reinforcement_learning_v_mpo
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Updated 2 years ago
Related projects: ⓘ
- ☆20Updated 5 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆24Updated last year
- Evaluation of TD-MPC2.☆22Updated 7 months ago
- ☆20Updated 4 months ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆29Updated 2 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆23Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated last year
- ☆21Updated 2 years ago
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆18Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆33Updated last year
- ☆18Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆10Updated last year
- Implementation of Sim2Seg (John So*, Amber Xie*, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali-akbar Agha-mohammad, Pieter Abbeel, Ste…☆30Updated last year
- ☆33Updated last year
- ☆21Updated 5 months ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆32Updated 2 years ago
- ☆13Updated 2 years ago
- A set of environments utilizing pybullet for simulation of robotic manipulation tasks.☆25Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆15Updated last year
- Simulation system for path planning evaluation☆10Updated 10 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆21Updated 5 months ago
- ☆20Updated 2 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Updated last year
- ☆23Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆37Updated 3 years ago
- [IROS 22'] Model-free Neural Lyapunov Control☆19Updated last year