theOGognf / rl8Links
A high throughput, end-to-end RL library for infinite-horizon tasks.
☆21Updated last month
Alternatives and similar repositories for rl8
Users that are interested in rl8 are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆74Updated 2 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆26Updated 2 years ago
- Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.☆162Updated 5 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- 🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper☆38Updated 5 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆34Updated 5 months ago
- fast + parallel AlphaZero in JAX☆106Updated 11 months ago
- A Multi-agent reinforcement-learning simulator framework.☆81Updated last year
- A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum …☆150Updated 3 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆40Updated 8 months ago
- A flexible and extensible reinforcement learning library for Python, designed for both beginners and researchers.☆18Updated 11 months ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29Updated 6 months ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆289Updated 7 months ago
- JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading☆36Updated 2 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆112Updated last month
- Reading list for adversarial perspective and robustness in deep reinforcement learning.☆126Updated 4 months ago
- A C++ pytorch implementation of MuZero☆41Updated last year
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆40Updated 3 weeks ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆22Updated 8 months ago
- A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…☆13Updated 2 years ago
- Reinforcement learning training framework for entity-gym environments.☆17Updated last year
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Updated 4 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆120Updated last year
- Ray RLlib tutorial material☆121Updated 3 years ago
- Produce intelligence by means of natural selection without objective/reward optimization☆15Updated 4 years ago
- Mini RL Lab☆17Updated last year
- Multi-objective Gymnasium environments for reinforcement learning☆353Updated 4 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆82Updated 11 months ago
- An easy-to-use reinforcement learning library for research and education.☆174Updated 2 weeks ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆60Updated 9 months ago