yaoliucs/PQL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yaoliucs/PQL)

yaoliucs / PQL

Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"

☆11

Alternatives and similar repositories for PQL

Users that are interested in PQL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Wenxuan-Zhou / PLAS
View on GitHub
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
☆54Oct 18, 2021Updated 4 years ago
RomainLaroche / SPIBB
View on GitHub
Safe Policy Improvement with Baseline Bootstrapping
☆26May 5, 2020Updated 6 years ago
usnistgov / CEGO
View on GitHub
C++11 Evolutionary Global Optimization
☆13Dec 12, 2024Updated last year
Mehooz / BIRD_code
View on GitHub
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14May 23, 2021Updated 5 years ago
qingqu06 / MCS-BD
View on GitHub
Code for multichannel sparse blind deconvolution problem.
☆12Jan 9, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
djin31 / VesselExtract
View on GitHub
U-net based CNN for segmenting blood vessel and thereafter removal of vessels from fundus image
☆12Jul 13, 2020Updated 6 years ago
leoliu1221 / matlabFaceRecognitionRealTime
View on GitHub
https://www.youtube.com/watch?v=hYwsnXm0uiw&list=UUFMOamu3rLNNT1tqx462FQw
☆10Nov 10, 2014Updated 11 years ago
tdhock / interactive-tutorial
View on GitHub
useR 2016 tutorial on "Understanding and creating interactive graphics"
☆13Jan 6, 2025Updated last year
yilundu / task_agnostic_dynamics_prior
View on GitHub
Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning
☆12Jun 13, 2019Updated 7 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 2 weeks ago
mannau / tm.plugin.sentiment
View on GitHub
Retrieve structured, textual data from various web sources.
☆19Sep 6, 2015Updated 10 years ago
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
andymiller / vboost
View on GitHub
code supplement for variational boosting (https://arxiv.org/abs/1611.06585)
☆11Jul 24, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
toshikwa / discor.pytorch
View on GitHub
PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.
☆37Jun 22, 2022Updated 4 years ago
lionelblonde / sam-pytorch
View on GitHub
PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
☆10Nov 22, 2019Updated 6 years ago
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
matsuolab / BREMEN
View on GitHub
Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)
☆54Jul 7, 2021Updated 5 years ago
pnnl / GridSTAGE
View on GitHub
☆28Aug 21, 2022Updated 3 years ago
microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
ramp-kits / rl_simulator
View on GitHub
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Mar 13, 2026Updated 4 months ago
CausalML / DoubleReinforcementLearningMDP
View on GitHub
☆14May 15, 2025Updated last year
TLKline / poreture
View on GitHub
Centerline Extraction, Skeletonization Methods, and Interbranch/Pore Segment Measurements.
☆20Nov 2, 2014Updated 11 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 5 years ago
banma12956 / HIPI-RL
View on GitHub
☆10Jun 22, 2020Updated 6 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
younggyoseo / pytorch-acer
View on GitHub
PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)
☆16Oct 7, 2020Updated 5 years ago
sjtuytc / AAAI21-RoutineAugmentedPolicyLearning
View on GitHub
Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration
☆17Jan 7, 2021Updated 5 years ago
NrLabFreiburg / inverse-q-learning
View on GitHub
☆15Oct 16, 2020Updated 5 years ago
syuntoku14 / pytorch-rl-il
View on GitHub
A library for building reinforcement learning and imitation learning agents in Pytorch
☆61Jun 13, 2020Updated 6 years ago
xuwkk / Robust_MTD
View on GitHub
This repo contains code and visualisation for "Robust moving target defence against false data injection attacks in power grids"
☆26Dec 8, 2022Updated 3 years ago
matpalm / cartpoleplusplus
View on GitHub
3d cartpole gym env using bullet physics trained from pixels with tensorflow LRPG, DDPG & NAF
☆58Jan 2, 2017Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
uscresl / humanoid-gail
View on GitHub
Humanoid behavior imitation using Generative Adversarial Imitation Learning (GAIL)
☆16Jul 1, 2020Updated 6 years ago
saiboxx / offline-reinforcement-learning
View on GitHub
Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)
☆17Jul 7, 2020Updated 6 years ago
huminan / PowerGrid
View on GitHub
simulate centralized and distributed power grid, in order to implement state estimation and bad data detection. Also, simplest FDI Attack…
☆23May 9, 2020Updated 6 years ago
LIBBLE / LIBBLE-PS
View on GitHub
LIBBLE by Parameter Server
☆17Sep 17, 2018Updated 7 years ago
Facebear-ljx / SBAC
View on GitHub
Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)
☆11Jul 4, 2022Updated 4 years ago
mlcircus / graphite_instructions
View on GitHub
Instructions for using the graphite cluster
☆22Apr 30, 2019Updated 7 years ago
deligentfool / GAIL_pytorch
View on GitHub
The implement of GAIL with pytorch
☆14Mar 11, 2020Updated 6 years ago