amazon-science / fast-rl-with-slow-updatesLinks
β18Updated last year
Alternatives and similar repositories for fast-rl-with-slow-updates
Users that are interested in fast-rl-with-slow-updates are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β97Updated 2 years ago
- Baselines for gymnax π€β68Updated 2 years ago
- AGAC: Adversarially Guided Actor-Criticβ48Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorchβ37Updated 4 months ago
- β44Updated 10 months ago
- Tabular methods for reinforcement learningβ38Updated 5 years ago
- Various reinforcement learning algorithms written in Jax + Flaxβ27Updated 2 years ago
- Standard interface for entity based reinforcement learning environments.β38Updated last year
- β28Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ114Updated 11 months ago
- A web based platform for collecting human actions in reinforcement learning environmentsβ30Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β58Updated 3 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated 11 months ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ102Updated 9 months ago
- β102Updated last year
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".β87Updated last year
- Explainable Reinforcement Learning (XRL) Resourcesβ41Updated 10 months ago
- The source code for the gym-microrts paper.β42Updated 3 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQNβ45Updated 4 years ago
- JAX library for MARL researchβ88Updated last year
- Reinforcement learning training framework for entity-gym environments.β17Updated last year
- Vectorization techniques for fast population-based training.β56Updated 2 years ago
- A tool for recording RL trajectories.β106Updated last week
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environmβ¦β41Updated 2 years ago
- β54Updated 9 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β88Updated 4 years ago
- β52Updated 2 years ago
- Reinforcement learning library in JAX.β100Updated last year
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to eβ¦β83Updated last year
- A2C is a special case of PPO!β22Updated 3 years ago