GRAAL-Research / OfflineRLReadingGroup
Offline Reinforcement Learning Reading Group
☆25Updated 2 years ago
Alternatives and similar repositories for OfflineRLReadingGroup:
Users that are interested in OfflineRLReadingGroup are comparing it to the libraries listed below
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- ☆54Updated 10 months ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- ☆53Updated last year
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 5 years ago
- ☆41Updated 3 years ago
- ☆46Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- ☆110Updated last year
- ☆47Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆173Updated 2 years ago
- Code for demonstration example-task in RUDDER blog☆22Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆146Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆106Updated 2 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆51Updated 4 years ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 4 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆73Updated 2 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆51Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆26Updated last year
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- ☆29Updated 2 years ago
- Distributional Soft Actor Critic☆50Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆159Updated 2 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago