jcoreyes / evolvingrl
Supplementary Data for Evolving Reinforcement Learning Algorithms
☆46Updated 4 years ago
Alternatives and similar repositories for evolvingrl:
Users that are interested in evolvingrl are comparing it to the libraries listed below
- Generalised UDRL☆37Updated 2 years ago
- ☆28Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆35Updated last month
- ☆53Updated 6 months ago
- ☆51Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆54Updated 2 years ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆49Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆78Updated last year
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆32Updated 9 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- ☆18Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Accompanying code for "Learning and Planning in Average-Reward Markov Decision Processes"☆14Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Library to compare and evaluate reward functions☆66Updated last year
- ☆31Updated 2 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning.☆25Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated 2 years ago