danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
Alternatives and similar repositories for crafter-baselines:
Users that are interested in crafter-baselines are comparing it to the libraries listed below
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆45Updated 4 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Generalised UDRL☆37Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆17Updated 11 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆21Updated 2 years ago
- ☆42Updated 4 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆83Updated 2 years ago
- My Body Is A Cage☆40Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆54Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- ☆44Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning.☆25Updated 2 years ago
- Change-Based Exploration Transfer☆36Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- ☆23Updated 2 years ago
- ☆15Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 2 years ago
- ☆29Updated 4 years ago