twni2016 / self-predictive-rl
Bridging State and History Representations: Understanding Self-Predictive RL -- ICLR 2024
☆18Updated 11 months ago
Alternatives and similar repositories for self-predictive-rl:
Users that are interested in self-predictive-rl are comparing it to the libraries listed below
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆20Updated 3 years ago
- ☆35Updated 2 years ago
- ☆16Updated 11 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆32Updated 2 years ago
- ☆23Updated last year
- ☆32Updated 7 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆12Updated 8 months ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 11 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- ☆23Updated 11 months ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆18Updated 4 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 11 months ago
- ☆18Updated 2 years ago
- ☆28Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- ☆17Updated 3 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆19Updated 6 months ago
- ☆36Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- ☆16Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago