gouxiangchen/soft-Q-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gouxiangchen/soft-Q-learning)

gouxiangchen / soft-Q-learning

discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!

☆58

Alternatives and similar repositories for soft-Q-learning

Users that are interested in soft-Q-learning are comparing it to the libraries listed below

Sorting:

lcalem / reproduction-soft-qlearning-mutual-information
View on GitHub
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Jan 10, 2019Updated 7 years ago
seolhokim / InverseRL-Pytorch
View on GitHub
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
☆68May 25, 2021Updated 4 years ago
deligentfool / GAIL_pytorch
View on GitHub
The implement of GAIL with pytorch
☆14Mar 11, 2020Updated 5 years ago
benellis3 / mappo
View on GitHub
☆18Aug 14, 2023Updated 2 years ago
bbrighttaer / guided-irl
View on GitHub
Inverse Reinforcement learning proof-of-concept using the Guided Cost/Reward Learning approach
☆10Mar 23, 2020Updated 5 years ago
jscriptcoder / Upside-Down-Reinforcement-Learning
View on GitHub
Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)
☆13Oct 25, 2023Updated 2 years ago
dbcbtc / RL-Papers
View on GitHub
papers about reinforcement learning
☆13Jan 4, 2021Updated 5 years ago
lionelblonde / sam-tf
View on GitHub
TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
☆10Dec 8, 2022Updated 3 years ago
xkianteb / dril
View on GitHub
Disagreement-Regularized Imitation Learning
☆30May 25, 2021Updated 4 years ago
deligentfool / maddpg
View on GitHub
Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch
☆10Aug 2, 2020Updated 5 years ago
williamshen-nz / STRIPS-HGN
View on GitHub
Learning Domain-Independent Planning Heuristics over Hypergraphs (ICAPS'20)
☆14Mar 21, 2025Updated 11 months ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 6 years ago
ruizhaogit / music
View on GitHub
Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)
☆38Mar 1, 2021Updated 5 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
david-abel / rl_info_theory
View on GitHub
A collection of code investigating the use of information theory for abstractions in RL
☆16Nov 14, 2018Updated 7 years ago
dhananjaisharma10 / Model-based-Reinforcement-Learning
View on GitHub
Model-based reinforcement learning using CEM, MPC and PETS
☆16Nov 20, 2019Updated 6 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
nric / ProximalPolicyOptimizationKeras
View on GitHub
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
☆12Sep 3, 2020Updated 5 years ago
LihaoR / Entropy-Regularized-RL
View on GitHub
soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
richardrl / rlkit-relational
View on GitHub
Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"
☆104Dec 8, 2022Updated 3 years ago
Mee321 / policy-distillation
View on GitHub
☆15Nov 22, 2019Updated 6 years ago
rudolfsteiner / DAgger
View on GitHub
Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)
☆21Apr 15, 2018Updated 7 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 4 years ago
haarnoja / softqlearning
View on GitHub
Reinforcement Learning with Deep Energy-Based Policies
☆436Nov 28, 2023Updated 2 years ago
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
salesforce / sibling-rivalry
View on GitHub
Code for Sibling Rivalry and experiments presented in associated paper
☆17May 1, 2025Updated 10 months ago
fiberleif / POfD
View on GitHub
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
☆16Jun 5, 2019Updated 6 years ago
johnlime / RlkitExtension
View on GitHub
Collection of reinforcement learning algorithms
☆16Sep 29, 2025Updated 5 months ago
FangchenLiu / SAIL
View on GitHub
Code for Paper "State Alignment-based Imitation Learning". Under maintenance
☆17May 1, 2020Updated 5 years ago
ssokota / mmd
View on GitHub
Code for magnetic mirror descent.
☆17Oct 5, 2023Updated 2 years ago
orybkin / video-gcp
View on GitHub
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
☆46Nov 22, 2022Updated 3 years ago
alversafa / option-critic-arch
View on GitHub
Implementation of the Option-Critic Architecture
☆41Dec 9, 2018Updated 7 years ago
polixir / OfflineRL
View on GitHub
A collection of offline reinforcement learning algorithms.
☆208Nov 26, 2024Updated last year
alex-petrenko / curious-rl
View on GitHub
Curiosity-driven Exploration by Self-supervised Prediction
☆23Jun 13, 2019Updated 6 years ago
camall3n / markov-state-abstractions
View on GitHub
Image-based gridworld experiment for learning Markov state abstractions
☆21Sep 16, 2024Updated last year
Manchery / iql-pytorch
View on GitHub
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
☆24Nov 4, 2024Updated last year
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆52Aug 27, 2022Updated 3 years ago
morning9393 / HAPPO-HATRPO
View on GitHub
☆45Nov 29, 2021Updated 4 years ago
rail-berkeley / softlearning
View on GitHub
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,405Nov 29, 2023Updated 2 years ago