natetsang / open-rl
Implementations of a large collection of reinforcement learning algorithms.
☆26Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for open-rl
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆19Updated last year
- DecentralizedLearning☆21Updated last year
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆17Updated last year
- Constrained Exploration and Recovery from Experience Shaping☆21Updated 5 years ago
- Distributed Deep Reinforcement Learning☆29Updated 3 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 8 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆26Updated 5 months ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆20Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆18Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Toolkit of Causal Model-based Reinforcement Learning.☆32Updated last year
- Experiments to train transformer network to master reinforcement learning environments.☆33Updated 3 years ago
- ☆71Updated 5 months ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13Updated 6 months ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆39Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆58Updated last year
- a modular reinforcement learning library with JAX agents☆22Updated last year
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago