SamsungLabs / tqc_pytorchLinks

Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/

☆97

Alternatives and similar repositories for tqc_pytorch

Users that are interested in tqc_pytorch are comparing it to the libraries listed below

Sorting:

evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆100Updated 3 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆133Updated 10 months ago
yusukeurakami / dreamer-pytorch
pytorch-implementation of Dreamer (Model-based Image RL Algorithm)
☆166Updated 4 months ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆68Updated last year
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆100Updated 3 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆168Updated 6 months ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 4 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆143Updated last year
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆85Updated last year
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆168Updated 3 years ago
denisyarats / exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
☆115Updated 3 years ago
xtma / dsac
Distributional Soft Actor Critic
☆53Updated 4 years ago
Farama-Foundation / D4RL-Evaluations
☆196Updated 2 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
☆71Updated 11 months ago
quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆189Updated 2 years ago
kngwyu / mujoco-maze
Simple maze environments using mujoco-py
☆56Updated last year
denisyarats / dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
☆216Updated last year
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆179Updated 3 years ago
jsikyoon / dreamer-torch
Pytorch version of Dreamer, which follows the original TF v2 codes.
☆125Updated 3 years ago
cross32768 / PlaNet_PyTorch
Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch
☆47Updated 5 years ago
danijar / director
Deep Hierarchical Planning from Pixels
☆102Updated 2 years ago
syuntoku14 / pytorch-rl-il
A library for building reinforcement learning and imitation learning agents in Pytorch
☆59Updated 4 years ago
denisyarats / pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
☆239Updated 5 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 2 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆138Updated 2 years ago
young-geng / CQL
Conservative Q Learning on top of SAC
☆130Updated 2 years ago
RajGhugare19 / alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆80Updated 2 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆49Updated 3 weeks ago
adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆74Updated 11 months ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆147Updated 3 years ago