andrecianflone / dynaq

Exploring the Dyna-Q reinforcement learning algorithm

☆16

Related projects ⓘ

Alternatives and complementary repositories for dynaq

IouJenLiu / CMAE
☆44Updated 3 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆38Updated 4 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆96Updated 2 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆114Updated 2 weeks ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆50Updated 3 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 3 years ago
david-simoes-93 / Mixed-Policy-Asynchronous-Deep-Q-Learning
Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…
☆15Updated 7 years ago
uoe-agents / lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
☆40Updated 2 months ago
dkkim93 / meta-mapg
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
☆32Updated 2 years ago
Sonkyunghwan / QTRAN
There will be updates later
☆82Updated 5 years ago
oxwhirl / comix
☆42Updated 3 years ago
wendelinboehmer / dcg
☆71Updated 5 months ago
chaovven / SMIX
Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020
☆26Updated last year
AlgTUDelft / AlwaysSafe
Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"
☆18Updated 2 years ago
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆30Updated last year
matteokarldonati / Counterfactual-Multi-Agent-Policy-Gradients
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
☆52Updated 4 years ago
kikojay / EMC
The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.
☆39Updated last year
saizhang0218 / TMC
Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"
☆26Updated 3 years ago
gjp1203 / nui_in_madrl
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆32Updated 5 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆95Updated 3 years ago
akifumi-wachi-4 / safe_near_optimal_mdp
Safe Reinforcement Learning in Constrained Markov Decision Processes
☆55Updated 4 years ago
saizhang0218 / VBC
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"
☆51Updated last year
HaiyinPiao / pytorch-a2clstm-DRQN
using recurrent networks(LSTM) to solve POMDPs
☆35Updated 6 years ago
sisl / DICG
Deep Implicit Coordination Graphs
☆41Updated 5 months ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆36Updated 5 years ago
dadadidodi / m3ddpg
☆47Updated 5 years ago
QDPP-GitHub / QDPP
Multi-Agent Determinantal Q-Learning
☆42Updated 2 years ago
anyboby / Constrained-Model-Based-Policy-Optimization
Code for a model-based version of Constrained Policy Optimization
☆10Updated 3 years ago
root-master / unified-hrl
Unified Model-Free Hierarchical Reinforcement Learning Framework
☆37Updated 5 years ago
ovechkin-dm / ppo-lstm-parallel
ppo-lstm-parallel
☆42Updated 5 years ago