CLAIRE-Labo / no-representation-no-trust
Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.
☆15Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for no-representation-no-trust
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆33Updated 2 weeks ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆89Updated this week
- Evaluating long-term memory of reinforcement learning algorithms☆132Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆87Updated 11 months ago
- ☆87Updated this week
- Learning diverse options through the Laplacian representation.☆22Updated 10 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 6 months ago
- ☆62Updated 2 months ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆44Updated 6 months ago
- Conservative Q learning in Jax☆50Updated last year
- ☆147Updated 2 months ago
- ☆64Updated last week
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 11 months ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆126Updated 2 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆85Updated 11 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆53Updated 5 months ago
- ☆36Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 3 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆12Updated last week
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated last year
- Deep Hierarchical Planning from Pixels☆90Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆57Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 6 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆94Updated 3 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- ☆28Updated 3 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆203Updated 3 weeks ago