lasgroup / opax
☆15Updated last month
Related projects ⓘ
Alternatives and complementary repositories for opax
- ☆30Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆19Updated 3 years ago
- Experiment code for "Continuous-Time Model-Based Reinforcement Learning"☆47Updated 11 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- improved Cross Entropy Method for trajectory optimization☆69Updated 3 years ago
- ☆19Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- MBRL library in JAX☆9Updated 2 years ago
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆22Updated 2 weeks ago
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆35Updated 2 years ago
- ☆33Updated 4 years ago
- ☆9Updated 4 years ago
- Companion code to "Learning Stable Deep Dynamics Models" (Manek and Kolter, 2019)☆32Updated 4 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- ☆42Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- ☆15Updated last year
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- An MPC algorithm which supports polytopic state and action constraints, using CEM optimisation.☆13Updated 5 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆12Updated 9 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆17Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- reinforcement learning from randomized simulations☆64Updated this week
- Conservative Q learning in Jax☆51Updated last year
- A toolbox for trajectory optimization of dynamical systems☆50Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago