abhishm/PGQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abhishm/PGQ)

abhishm / PGQ

PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.

☆15

Alternatives and similar repositories for PGQ

Users that are interested in PGQ are comparing it to the libraries listed below

Sorting:

mrdrozdov / pytorch-machines
View on GitHub
Stochastic Machines for Unsupervised Learning implemented in Pytorch.
☆10Sep 3, 2017Updated 8 years ago
Santara / stochastic_value_gradient
View on GitHub
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Jan 15, 2022Updated 4 years ago
tesatory / selfplay
View on GitHub
Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play
☆14May 1, 2018Updated 7 years ago
gkahn13 / CAPs
View on GitHub
☆33Oct 17, 2018Updated 7 years ago
sungyubkim / amortized_svgd
View on GitHub
A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN
☆18Dec 13, 2018Updated 7 years ago
rlbayes / rllabplusplus
View on GitHub
☆160Jul 21, 2017Updated 8 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
shagunsodhani / memory-augmented-self-play
View on GitHub
PyTorch implementation of Memory Augmented Self-Play
☆52Oct 26, 2020Updated 5 years ago
ofirnachum / models
View on GitHub
Models built with TensorFlow
☆26Dec 5, 2018Updated 7 years ago
jingweiz / pytorch-distributed
View on GitHub
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Jun 18, 2019Updated 6 years ago
deeplearninc / relaax
View on GitHub
Reinforcement Learning framework to facilitate development and use of scalable RL algorithms and applications
☆61Mar 29, 2018Updated 7 years ago
david-abel / rl_abstraction
View on GitHub
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Dec 11, 2020Updated 5 years ago
schroederdewitt / mackrl
View on GitHub
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Dec 1, 2019Updated 6 years ago
tesslerc / H-DRLN
View on GitHub
Hierarchical Deep RL Network
☆31Feb 20, 2017Updated 9 years ago
AmosLewis / Awesome-Robotics
View on GitHub
Robotics Learning Note
☆11Jun 22, 2018Updated 7 years ago
itaicaspi / mgail
View on GitHub
Model-Based Generative Adversarial Imitation Learning
☆89Mar 29, 2021Updated 4 years ago
alshedivat / lola
View on GitHub
Code release for Learning with Opponent-Learning Awareness and variations.
☆151Apr 13, 2023Updated 2 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
View on GitHub
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆38Feb 5, 2019Updated 7 years ago
cshenton / neuroevolution
View on GitHub
Replication of Uber Neuroevolution paper
☆46Apr 14, 2018Updated 7 years ago
rstager / ARoboCar
View on GitHub
Unreal Engine simulator for our self driving car training
☆11Nov 18, 2021Updated 4 years ago
ZhuFengdaaa / MAIN
View on GitHub
☆13Feb 22, 2023Updated 3 years ago
zackchase / intrinsic-fear-dqn
View on GitHub
Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
ahundt / enas
View on GitHub
TensorFlow code for paper "Training Frankenstein's Creature to Stack: HyperTree Architecture Search"
☆13Nov 14, 2018Updated 7 years ago
biomag-lab / hypocotyl-UNet
View on GitHub
☆12Dec 2, 2020Updated 5 years ago
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
eringrant / spirl-readings
View on GitHub
A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.
☆13May 5, 2021Updated 4 years ago
duckietown-udem / udem-fall19-public
View on GitHub
Public accompanying repository for Universite de Montreal's IFT 6757: Autnonomous Vehicles, Fall 2019.
☆12Jun 21, 2022Updated 3 years ago
yhyu13 / C51-DDPG
View on GitHub
This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)
☆11Sep 14, 2017Updated 8 years ago
san99tiago / aws-cdk-ecs-api
View on GitHub
AWS FastAPI deployment on top of ALB and ECS with Docker containers implementing ECS as the orchestration tool for an AWS-managed infrast…
☆10May 22, 2023Updated 2 years ago
dmalyuta / explicit_hybrid_mpc
View on GitHub
Approximate Multiparametric Mixed-integer Convex Programming
☆15May 16, 2019Updated 6 years ago
icesit / sjtu_drone
View on GitHub
ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.
☆10Oct 27, 2017Updated 8 years ago
mubeipeng / focused_slam
View on GitHub
☆11Sep 15, 2016Updated 9 years ago
mschulth / rhc
View on GitHub
Implementation of Receding Horizon Curiosity Algrithm
☆13Mar 24, 2023Updated 2 years ago
obilaniu / Nauka
View on GitHub
Nauka is a collection of utilities for scientific experiments.
☆15Jul 27, 2022Updated 3 years ago
ankurhanda / dexpilot
View on GitHub
paper on dexpilot
☆15Oct 14, 2019Updated 6 years ago
Storyyeller / fnv-collider
View on GitHub
FNV hash collision generator
☆12Mar 2, 2017Updated 9 years ago
lcalem / reproduction-soft-qlearning-mutual-information
View on GitHub
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Jan 10, 2019Updated 7 years ago
GokuMohandas / SELU
View on GitHub
🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.
☆12Jun 19, 2017Updated 8 years ago
anirudh9119 / rl_adversarial
View on GitHub
Learning Backtracking Models, ICLR'19
☆10Feb 2, 2018Updated 8 years ago