AmiiThinks / AlphaEx
A Python Toolkit for Managing a Large Number of Experiments
☆31Updated last year
Alternatives and similar repositories for AlphaEx:
Users that are interested in AlphaEx are comparing it to the libraries listed below
- PAIRED in PyTorch 🔥☆58Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- An implementation of MuZero in JAX.☆55Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆140Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- krazy grid world☆25Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- General Modules for JAX☆64Updated this week
- Nethack Learning Environment Wrapper for Language Interface☆36Updated last year
- Scaling scaling laws with board games.☆48Updated last year
- ☆29Updated 4 years ago
- ☆42Updated last year
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- ☆43Updated 5 months ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Efficient Exploration via State Marginal Matching (2019)☆67Updated 5 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- Reinforcement Learning with Latent Flow☆43Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- Baselines for gymnax 🤖☆65Updated last year