zhougroup / IDACLinks

Implicit Distributional Actor Critic

☆11

Alternatives and similar repositories for IDAC

Users that are interested in IDAC are comparing it to the libraries listed below

Sorting:

uncharted-technologies / risk-and-uncertainty
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆30Updated 2 years ago
LinZichuan / AdMRL
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Updated 4 years ago
xingchenwan / bgpbt
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆28Updated 2 years ago
dnishio / DSAC
The implementation of Discriminator Soft Actor Critic
☆15Updated 5 years ago
YyzHarry / SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Updated 5 years ago
zwfightzw / Meta-Critic
☆11Updated 4 years ago
AnujMahajanOxf / VIREL
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Updated 5 years ago
TrentBrick / RewardConditionedUDRL
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆18Updated 4 years ago
dongminlee94 / Reinforcement-Learning-Code
A repository for code of reinforcement learning algorithms with PyTorch
☆30Updated 3 years ago
BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆23Updated 4 years ago
bonniesjli / DQN_SR
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Updated 6 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated 8 months ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
snu-mllab / EMI
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆36Updated 4 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
subho406 / Recurrent-PPO-Jax
Implementation of Proximal Policy Optimization in Jax+Flax
☆20Updated 2 years ago
liziniu / HyperDQN
Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)
☆12Updated last year
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
dannysdeng / dqn-pytorch
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Updated 5 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
tuomaso / radial_rl
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Updated last year
Cranial-XIX / metric-residual-network
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
☆17Updated 2 years ago
mjanschek / pytorch_seed_rl
A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Updated 4 years ago
philipjball / SAC_PyTorch
🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation
☆38Updated 3 years ago
sfujim / SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆17Updated 3 years ago
thanhnguyentang / mmdrl
Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354
☆27Updated 4 years ago
ben-eysenbach / mnm
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆20Updated 3 years ago
atavakol / action-hypergraph-networks
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Updated 4 years ago