CLAIRE-Labo / no-representation-no-trust
Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.
☆24Updated 2 months ago
Alternatives and similar repositories for no-representation-no-trust:
Users that are interested in no-representation-no-trust are comparing it to the libraries listed below
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 7 months ago
- ☆67Updated 5 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆19Updated 2 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆41Updated 2 months ago
- General Modules for JAX☆62Updated 6 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- ☆41Updated last year
- ☆29Updated 3 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆47Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆94Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 5 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆76Updated 9 months ago
- Simple JAX Graphics Library.☆29Updated 2 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 9 months ago
- Conservative Q learning in Jax☆52Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 10 months ago
- ☆18Updated this week
- Corax: Core RL in JAX☆36Updated 11 months ago
- ☆46Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆135Updated 2 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆60Updated 7 months ago
- ☆42Updated 6 months ago
- POPGym Library in JAX☆11Updated 9 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 5 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆60Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- ☆24Updated 7 months ago