baturaysaglam / LA3PLinks
Actor Prioritized Experience Replay
☆16Updated last year
Alternatives and similar repositories for LA3P
Users that are interested in LA3P are comparing it to the libraries listed below
Sorting:
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- Meta RL codebase for Unstable Baselines☆21Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- ☆49Updated 4 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆140Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆111Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆31Updated 2 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆60Updated 2 years ago
- Distributional Soft Actor Critic☆58Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆180Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 9 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆165Updated last year
- ☆43Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆89Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆209Updated 11 months ago
- Transformer in RL for decision-making☆100Updated 2 years ago
- ☆30Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆51Updated 3 months ago
- ☆57Updated 2 years ago
- ☆14Updated 4 years ago
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆48Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆130Updated 6 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆151Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆59Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆65Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆26Updated 2 years ago