This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆22Nov 29, 2025Updated 3 months ago
Alternatives and similar repositories for ais
Users that are interested in ais are comparing it to the libraries listed below
Sorting:
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆24Apr 7, 2024Updated last year
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated last year
- Everything needed to replicate the figures from the paper "Engineering recurrent neural networks from task-relevant manifolds and dynamic…☆12Dec 5, 2019Updated 6 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- ☆15Jan 30, 2021Updated 5 years ago
- Course notes for ECSE 506: Stochastic Control and Decision Theory☆39Updated this week
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Jan 16, 2023Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆19Jan 9, 2025Updated last year
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆24May 11, 2024Updated last year
- [ICML 2021] Learning Task Informed Abstractions -- a representation learning approach for model-based RL in complex visual domains☆18Jul 20, 2021Updated 4 years ago
- Some python functions to supplement the NEURON python module☆29Jun 25, 2019Updated 6 years ago
- A PyTorch implementation of MPC as a Function Approximator☆19Sep 27, 2021Updated 4 years ago
- ☆25Aug 4, 2023Updated 2 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆20Jan 4, 2026Updated last month
- Factored model-based Bayesian Reinforcement Learning framework☆22Nov 23, 2022Updated 3 years ago
- Difference-of-Entropies (DoE) Estimator☆26Apr 13, 2022Updated 3 years ago
- ☆23Aug 19, 2022Updated 3 years ago
- SATA: Safe and Adaptive Torque-Based Locomotion Policies Inspired by Animal Learning☆39May 19, 2025Updated 9 months ago
- Recurrent state-space models for decision making☆30Oct 25, 2022Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆342Aug 22, 2024Updated last year
- ☆26Jun 19, 2020Updated 5 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Jan 18, 2024Updated 2 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆30Dec 28, 2017Updated 8 years ago
- Code needed to reproduce the examples found in "Learning Control Barrier Functions from Expert Demonstrations," by A. Robey, H. Hu, L. Li…☆73Aug 13, 2023Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- implementation of our self-guided and self-regularized actor-critic algorithm☆30Jan 1, 2023Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- [L4DC 2025] Official code repository for "Diffusion Predictive Control with Constraints".☆52Jun 20, 2025Updated 8 months ago
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆95Apr 8, 2024Updated last year
- Software package for intertemporal pricing optimization under reference effects and consumer heterogeneity estimation. Please see REAMDE.…☆10Mar 7, 2024Updated last year
- Multi-resource Dynamic Coordinated Planning of Flexible Distribution Network☆15Jun 11, 2024Updated last year
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago