CLAIRE-Labo / no-representation-no-trustLinks

Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.

☆29

Alternatives and similar repositories for no-representation-no-trust

Users that are interested in no-representation-no-trust are comparing it to the libraries listed below

Sorting:

luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆110Updated last year
kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year
EmptyJackson / unifloral
Unified Implementations of Offline Reinforcement Learning Algorithms
☆120Updated last month
Div-Infinity / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
seohongpark / HILP
Foundation Policies with Hilbert Representations (ICML 2024)
☆102Updated last month
ahmed-touati / controllable_agent
☆52Updated 2 years ago
mklissa / dceo
Learning diverse options through the Laplacian representation.
☆23Updated last year
twitter-research / hyperbolic-rl
☆57Updated 3 years ago
ethanluoyc / corax
Corax: Core RL in JAX
☆38Updated last year
DAVIAN-Robotics / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆77Updated 2 weeks ago
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆60Updated last year
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆84Updated last year
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆117Updated last year
cassidylaidlaw / effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
☆50Updated last year
vivekmyers / contrastive_planning
Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"
☆43Updated last year
micahcarroll / uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
☆57Updated last year
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆121Updated 4 months ago
SonyResearch / simba
☆110Updated 8 months ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆104Updated last year
enjeeneer / zero-shot-rl
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆21Updated 10 months ago
RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Updated last year
ucl-dark / skillhack
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning
☆17Updated 3 years ago
sail-sg / rosmo
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Updated 2 years ago
facebookresearch / gen_dgrl
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆28Updated last year
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆212Updated last week
nissymori / JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
☆160Updated 10 months ago
danijar / elements
Building blocks for productive research
☆63Updated 3 months ago
seohongpark / METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆79Updated 2 years ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆77Updated last year
ElisevanderPol / symmetrizer
☆32Updated 4 years ago