ML-Collective / rl-playgroundLinks
This repo is for members of the rl-implementation channel on MLC Discord to play with RL algorithms and learn.
☆10Updated 3 years ago
Alternatives and similar repositories for rl-playground
Users that are interested in rl-playground are comparing it to the libraries listed below
Sorting:
- Reinforcement learning library in JAX.☆100Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆33Updated last year
- ☆36Updated 2 years ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆15Updated last year
- An environment for benchmarking commonsense agents☆29Updated 4 years ago
- ☆53Updated 7 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- ☆28Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Documentation for dynamic machine learning systems.☆29Updated 8 months ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆10Updated 4 years ago
- ☆31Updated 2 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- ☆56Updated 2 years ago
- Scaling scaling laws with board games.☆49Updated last year
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆14Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 9 months ago
- Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.☆21Updated 3 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆11Updated 5 years ago
- General Modules for JAX☆66Updated 2 months ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- PAIRED in PyTorch 🔥☆60Updated 2 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 3 years ago
- ☆26Updated 2 years ago