anyscale / rl-courseLinks
☆25Updated 2 years ago
Alternatives and similar repositories for rl-course
Users that are interested in rl-course are comparing it to the libraries listed below
Sorting:
- Understanding RL vision Distill article☆24Updated 2 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆83Updated last year
- Reinforcement learning library in JAX.☆100Updated last year
- Performant, differentiable reinforcement learning☆123Updated last month
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- An introductory tutorial about leveraging Ray core features for distributed patterns.☆78Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 2 weeks ago
- A C++ pytorch implementation of MuZero☆40Updated last year
- Accelerated replay buffers in JAX☆43Updated 3 years ago
- Gym wrapper for pysc2☆10Updated 3 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆72Updated 2 years ago
- ☆13Updated 3 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆26Updated 10 months ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆97Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Baselines for gymnax 🤖☆71Updated 2 years ago
- ☆88Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- A distributed GPU-centric experience replay system for large AI models.☆18Updated 2 years ago
- ☆13Updated last year
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆64Updated last year
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago
- ☆29Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago