info-structures / ais
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆19Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for ais
- ☆30Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆17Updated last year
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆42Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- Factored model-based Bayesian Reinforcement Learning framework☆20Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- ☆21Updated 7 months ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆39Updated 2 years ago
- Working directory for dynamics learning for experimental robots.☆56Updated 3 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Updated 6 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- Gym-like extensions for POMDP☆56Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆25Updated 4 years ago
- ☆34Updated last year
- ☆18Updated 2 years ago
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆23Updated 3 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- Implementations of SAILR, PDO, and CSC☆31Updated 4 months ago
- Proximal Policy Option-Critic☆21Updated 5 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆15Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆32Updated 2 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- Bridging State and History Representations: Understanding Self-Predictive RL -- ICLR 2024☆13Updated 7 months ago