Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024
☆27Apr 26, 2026Updated last month
Alternatives and similar repositories for self-predictive-rl
Users that are interested in self-predictive-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Artifact for paper "Chronosymbolic: Efficient CHC Solving with Symbolic Reasoning and Inductive Learning" in Python☆11Aug 4, 2024Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆23Nov 29, 2025Updated 6 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆73Apr 26, 2026Updated last month
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- ☆12Sep 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Jan 26, 2024Updated 2 years ago
- Code for recreating the results of our RSS 2020 paper, 'Learning Memory-Based Control for Human-Scale Bipedal Locomotion.'☆10Aug 18, 2022Updated 3 years ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated 2 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆30Mar 16, 2026Updated 2 months ago
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- Tools for manipulating CHC and related files☆15Apr 21, 2023Updated 3 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆30Jul 14, 2021Updated 4 years ago
- An extension of deeplab-v2 (in TF) allowing for smoothed dilated convolutions☆12Mar 27, 2019Updated 7 years ago
- The Laser Learning Environment (LLE) is a cooperative MARL grid-world☆13Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Auto Differentiate from scratch based on Autograd☆11Jun 21, 2022Updated 3 years ago
- Submission Under Review☆17May 15, 2025Updated last year
- Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)☆20Mar 18, 2024Updated 2 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆29Jan 14, 2025Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- [ICML 2021] Learning Task Informed Abstractions -- a representation learning approach for model-based RL in complex visual domains☆18Jul 20, 2021Updated 4 years ago
- Thinker project☆16Sep 4, 2024Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆39Jan 16, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated 2 years ago
- Simulation of manufacturing systems☆15Mar 15, 2022Updated 4 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- ☆31Feb 20, 2021Updated 5 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 3 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21May 11, 2023Updated 3 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆36Apr 25, 2024Updated 2 years ago
- Creating an ATT&CK Navigator layer with the detection coverage of the signals available within Tanium Threat Response.☆11Jun 2, 2021Updated 5 years ago
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆32Jan 28, 2026Updated 4 months ago
- A too simple vsti synth made with wdl-ol☆18Mar 27, 2017Updated 9 years ago
- PWM: Policy Learning with Large World Models☆69Aug 4, 2025Updated 10 months ago