swan-utokyo / deir
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
☆18Updated 4 months ago
Related projects: ⓘ
- ☆56Updated 3 weeks ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆27Updated last week
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆49Updated 11 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆25Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆77Updated 2 years ago
- General Modules for JAX☆57Updated last month
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆10Updated last month
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆17Updated 2 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆112Updated 2 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆90Updated last year
- Object Centric Atari games☆43Updated this week
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 7 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- ☆41Updated 3 years ago
- ☆11Updated this week
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆55Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆51Updated 5 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆20Updated 11 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- ☆34Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- ☆20Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆67Updated 2 years ago
- Baselines for gymnax 🤖☆57Updated last year
- My Body Is A Cage☆37Updated 3 years ago