guptav96 / BDQN-PyTorch
Efficient Exploration through Bayesian Deep-Q Networks.
☆17Updated 2 years ago
Alternatives and similar repositories for BDQN-PyTorch:
Users that are interested in BDQN-PyTorch are comparing it to the libraries listed below
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- DecentralizedLearning☆24Updated 2 years ago
- ☆42Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆15Updated 4 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆19Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 3 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆26Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆19Updated 3 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆34Updated 3 weeks ago
- ☆12Updated 11 months ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆21Updated last year
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- Implementation of HindSight Experience Replay paper with Pytorch☆27Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆54Updated 3 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆36Updated 7 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆39Updated 4 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- qmix☆22Updated 4 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- ☆32Updated 6 months ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆36Updated 10 months ago
- Synthetic Experience Replay☆87Updated 9 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated this week
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆45Updated 2 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆30Updated 4 years ago