louaaron/GAN-Q-Learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/louaaron/GAN-Q-Learning)

louaaron / GAN-Q-Learning

Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874

☆47

Alternatives and similar repositories for GAN-Q-Learning

Users that are interested in GAN-Q-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
cocoissong / TableGAN
View on GitHub
☆13Sep 11, 2018Updated 7 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
StanfordVL / ac-teach
View on GitHub
Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers
☆24Feb 15, 2023Updated 3 years ago
kshmelkov / gan_evaluation
View on GitHub
Code release for paper "How good is my GAN?"
☆12Mar 9, 2019Updated 7 years ago
kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
dwjshift / IL_ADS
View on GitHub
code for the paper Imitation Learning from Observation with Automatic Discount Scheduling
☆13Mar 27, 2024Updated 2 years ago
Stanford-ILIAD / ILEED
View on GitHub
Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"
☆11Jul 5, 2023Updated 3 years ago
QDPP-GitHub / QDPP
View on GitHub
Multi-Agent Determinantal Q-Learning
☆43Nov 22, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
lmb-freiburg / td-or-not-td
View on GitHub
Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…
☆12Aug 24, 2018Updated 7 years ago
farismismar / Q-Learning-Power-Control
View on GitHub
Code for the following publication: F. B. Mismar, J. Choi, and B. L. Evans, "A Framework for Automated Cellular Network Tuning with Rein…
☆51Jan 24, 2022Updated 4 years ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
Pearl-UTexas / ActiveVaR
View on GitHub
Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"
☆17Dec 20, 2018Updated 7 years ago
nhynes / abc
View on GitHub
SeqGAN but with more bells and whistles
☆24Feb 15, 2018Updated 8 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
alshedivat / lola
View on GitHub
Code release for Learning with Opponent-Learning Awareness and variations.
☆152Apr 13, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
jesbu1 / carl
View on GitHub
Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings
☆14Nov 22, 2022Updated 3 years ago
feidieufo / homework
View on GitHub
Assignments for CS294-112.
☆30Sep 11, 2019Updated 6 years ago
yangmuzhi / airl
View on GitHub
learning robust rewards with adversarial inverse reinforcement learning
☆14Sep 13, 2020Updated 5 years ago
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 8 years ago
AndreaTirinzoni / iw-transfer-rl
View on GitHub
Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).
☆16May 29, 2018Updated 8 years ago
Yu-Maryland / RESPECT
View on GitHub
RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs (DAC'23)
☆11Apr 13, 2023Updated 3 years ago
facebookresearch / gwil
View on GitHub
Cross-Domain Imitation Learning via Optimal Transport
☆27Jun 24, 2022Updated 4 years ago
davidsandberg / rl_ssms
View on GitHub
State Space Models for Reinforcement Learning in Tensorflow
☆19Jan 27, 2019Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zouchangjie / RL-Nash-Q-learning
View on GitHub
强化学习中纳什Qlearning 实现矩阵博弈
☆31Feb 25, 2019Updated 7 years ago
uncharted-technologies / risk-and-uncertainty
View on GitHub
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago
sugiyama404 / ReinfoceLearningForTrading
View on GitHub
☆13Mar 31, 2024Updated 2 years ago
liyiying / meta-MADDPG
View on GitHub
meta-MADDPG (Python implementation)
☆19Sep 16, 2018Updated 7 years ago
ermongroup / MA-AIRL
View on GitHub
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
☆220Jun 19, 2019Updated 7 years ago
xinleipan / gym-gridworld
View on GitHub
Simple grid-world environment compatible with OpenAI-gym
☆50Mar 19, 2020Updated 6 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago