LiuShuai26 / Distributed-RLLinks
Distributed DRL by Ray and TensorFlow Tutorial.
☆10Updated 5 years ago
Alternatives and similar repositories for Distributed-RL
Users that are interested in Distributed-RL are comparing it to the libraries listed below
Sorting:
- A distributed GPU-centric experience replay system for large AI models.☆18Updated 2 years ago
- ☆18Updated 6 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Updated 4 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆34Updated last year
- Distributed ML Optimizer☆32Updated 4 years ago
- FEN Code☆38Updated 5 years ago
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Updated 5 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆76Updated 5 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- ☆30Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆21Updated 8 months ago
- Assignments for CS294-112 Fall2018 in Pytorch☆65Updated 6 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆30Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- ☆46Updated last month
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 6 months ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- StarCraft 2 Imitation Learning☆29Updated 4 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 4 years ago
- Code for the paper "Batch size invariance for policy optimization"☆52Updated 2 years ago