zuzuba / CISR_NeurIPS20
☆18Updated 4 years ago
Alternatives and similar repositories for CISR_NeurIPS20:
Users that are interested in CISR_NeurIPS20 are comparing it to the libraries listed below
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆45Updated 3 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆34Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Implementations of SAILR, PDO, and CSC☆32Updated 9 months ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆37Updated 2 years ago
- ☆21Updated 11 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆43Updated 6 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆20Updated 3 years ago
- ☆18Updated 2 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆23Updated last year
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆18Updated 2 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Updated 6 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17Updated 2 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆53Updated 4 years ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆24Updated last year
- Safe Reinforcement Learning with Natural Language Constraints☆15Updated 3 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆18Updated 3 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- ☆23Updated 11 months ago
- DecentralizedLearning☆24Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆28Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year