gebob19 / rl_with_jax
clear single-file JAX implementations of common RL algorithms
☆14Updated 3 years ago
Related projects: ⓘ
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- ☆37Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆16Updated 3 years ago
- ☆18Updated 7 months ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆62Updated 4 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆33Updated last week
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆24Updated 2 years ago
- ☆16Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- ☆33Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆47Updated 2 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- ☆41Updated last year
- ☆51Updated last year
- Advantage weighted Actor Critic for Offline RL☆46Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Bipedal Skills Benchmark for Reinforcement Learning☆23Updated last year
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 4 years ago