amazon-science / fast-rl-with-slow-updates
β18Updated last year
Alternatives and similar repositories for fast-rl-with-slow-updates:
Users that are interested in fast-rl-with-slow-updates are comparing it to the libraries listed below
- Baselines for gymnax π€β66Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β96Updated last year
- AGAC: Adversarially Guided Actor-Criticβ48Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according β¦β35Updated 11 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithmsβ46Updated 4 years ago
- Vectorization techniques for fast population-based training.β55Updated 2 years ago
- β28Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β54Updated 2 years ago
- Docker containers of baseline agents for the Crafter environmentβ28Updated 3 years ago
- Generalised UDRLβ37Updated 2 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorchβ17Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)β9Updated last year
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement leaβ¦β36Updated last year
- A2C is a special case of PPO!β20Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorchβ35Updated last month
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.β21Updated 4 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β48Updated this week
- Scalable Opponent Shaping Experiments in JAXβ24Updated last year
- A web based platform for collecting human actions in reinforcement learning environmentsβ28Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated 2 years ago
- Standard interface for entity based reinforcement learning environments.β37Updated last year
- The source code for the gym-microrts paper.β42Updated 2 years ago
- β43Updated 7 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ111Updated 8 months ago
- Various reinforcement learning algorithms written in Jax + Flaxβ24Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ68Updated 3 years ago
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementationβ20Updated 2 years ago
- Gym wrapper for pysc2β10Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"β44Updated last year