baturaysaglam / actor-prioritized-exp-replayLinks
Actor Prioritized Experience Replay
☆17Updated 2 years ago
Alternatives and similar repositories for actor-prioritized-exp-replay
Users that are interested in actor-prioritized-exp-replay are comparing it to the libraries listed below
Sorting:
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- Meta RL codebase for Unstable Baselines☆22Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆170Updated last year
- Official implementation of Neural Episodic Control with State Abstraction☆13Updated 2 years ago
- ☆49Updated 4 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆146Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated 2 years ago
- ☆43Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- ☆15Updated 4 years ago
- Authors' implementation of PEER☆11Updated 2 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆27Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆62Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- ☆30Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆26Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆158Updated last year
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Updated 3 years ago
- Transformer in RL for decision-making☆103Updated 2 years ago
- ☆30Updated 4 years ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆41Updated last year
- Collection of OpenAI parametrized action-space environments.☆67Updated 9 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 3 years ago
- Prioritized Sequence Experience Replay☆10Updated 4 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- ☆20Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆55Updated 4 years ago