amazon-science / fast-rl-with-slow-updates
☆18Updated last year
Alternatives and similar repositories for fast-rl-with-slow-updates
Users that are interested in fast-rl-with-slow-updates are comparing it to the libraries listed below
Sorting:
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆55Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆96Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 2 months ago
- AGAC: Adversarially Guided Actor-Critic☆49Updated 3 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆24Updated last year
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- ☆28Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 11 months ago
- ☆44Updated 7 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆9Updated 7 months ago
- flexible meta-learning in jax☆13Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆50Updated 2 weeks ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆28Updated last year
- A2C is a special case of PPO!☆21Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆78Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- Reinforcement learning in pure JAX.☆12Updated 2 months ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆36Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago