guptav96 / BDQN-PyTorch
Efficient Exploration through Bayesian Deep-Q Networks.
☆17Updated 3 years ago
Alternatives and similar repositories for BDQN-PyTorch:
Users that are interested in BDQN-PyTorch are comparing it to the libraries listed below
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆36Updated 2 months ago
- DecentralizedLearning☆24Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆13Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆29Updated last year
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆19Updated 2 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆15Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆12Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆21Updated 5 months ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆46Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)☆35Updated 5 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆40Updated 9 months ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆17Updated 3 years ago
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆41Updated 4 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆47Updated 9 months ago
- The official implementation of Memory-efficient DQN algorithm.☆10Updated last year
- A library for mean-field games.☆51Updated this week
- Collection of OpenAI parametrized action-space environments.☆64Updated last month
- ☆28Updated 10 months ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 4 years ago