zhihanyang2022 / drqnLinks

Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert

☆8

Alternatives and similar repositories for drqn

Users that are interested in drqn are comparing it to the libraries listed below

Sorting:

zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆85Updated last year
yashbonde / Transformer-RL
Experiments to train transformer network to master reinforcement learning environments.
☆32Updated 4 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
LihaoR / Entropy-Regularized-RL
soft q learning and soft actor critic
☆15Updated 6 years ago
danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
rpatrik96 / AttA2C
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
☆27Updated 5 years ago
yeshenpy / RACE
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…
☆34Updated last year
baimingc / delay-aware-MBRL
Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".
☆26Updated 5 years ago
hari-sikchi / LOOP
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆39Updated 2 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated 8 months ago
dhananjaisharma10 / Model-based-Reinforcement-Learning
Model-based reinforcement learning using CEM, MPC and PETS
☆16Updated 5 years ago
yuchen-x / MacroMARL
☆21Updated last year
alirezakazemipour / SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆28Updated last month
tesslerc / ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆44Updated 6 years ago
pairlab / d2rl
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆39Updated 4 years ago
xupei0610 / KDMA
[IROS2021] Human-Inspired Multi-Agent Navigation using Knowledge Distillation
☆24Updated last year
dnandha / mopac
Model Predictive Actor-Critic Reinforcement Learning
☆63Updated 3 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
gjp1203 / nui_in_madrl
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆33Updated 6 years ago
udion / Transformer-RL
Experiments with transformer based RL algorithms
☆22Updated 5 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
IanRDavies / LeMOL
Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…
☆14Updated 3 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
HannesStark / gnn-reinforcement-learning
Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.
☆33Updated 4 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated last week
akifumi-wachi-4 / safe_near_optimal_mdp
Safe Reinforcement Learning in Constrained Markov Decision Processes
☆59Updated 4 years ago
wisnunugroho21 / reinforcement_learning_v_mpo
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆17Updated 3 years ago
BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆23Updated 4 years ago
suneelbelkhale / model-based-meta-rl-for-flight
Codebase for Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads paper. Website: https://sites.google.com/view/met…
☆31Updated 2 years ago
caslab-vt / SARNet
Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)
☆26Updated 3 years ago