twni2016 / self-predictive-rl
Bridging State and History Representations: Understanding Self-Predictive RL -- ICLR 2024
☆13Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for self-predictive-rl
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆55Updated 10 months ago
- ☆34Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆19Updated 3 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆17Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- ☆38Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆18Updated 5 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- Conservative Q learning in Jax☆51Updated last year
- ☆15Updated 7 months ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- ☆22Updated 9 months ago
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- ☆40Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆17Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- ☆18Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆40Updated last week
- ☆21Updated 7 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 7 months ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year