jjccero / pbrlLinks
A Population Based Reinforcement Learning Library based on PyTorch
☆26Updated 2 years ago
Alternatives and similar repositories for pbrl
Users that are interested in pbrl are comparing it to the libraries listed below
Sorting:
- Meta RL codebase for Unstable Baselines☆22Updated 2 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆20Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆31Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆60Updated 2 years ago
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆19Updated 2 years ago
- ☆24Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆59Updated last year
- ☆115Updated 2 years ago
- [ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)☆13Updated last year
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆62Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Model-based Hindsight Experience Replay☆10Updated 3 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆13Updated 3 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- An unofficial implementation for online decision transformer☆40Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆151Updated 2 years ago
- ☆26Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆94Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 5 months ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated 2 years ago