robintyh1/onpolicybaselines

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/robintyh1/onpolicybaselines)

robintyh1 / onpolicybaselines

on-policy optimization baselines for deep reinforcement learning

☆32

Alternatives and similar repositories for onpolicybaselines

Users that are interested in onpolicybaselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

koulanurag / dream-and-search
View on GitHub
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
☆12Jul 12, 2021Updated 5 years ago
tuomaso / radial_rl
View on GitHub
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Oct 3, 2023Updated 2 years ago
AutumnWu / Streamlined-Off-Policy-Learning
View on GitHub
ICRL 2020
☆20Feb 18, 2020Updated 6 years ago
BY571 / Upside-Down-Reinforcement-Learning
View on GitHub
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.
☆79Aug 13, 2020Updated 5 years ago
xuanlinli17 / iclr2021_rlreg
View on GitHub
Regularization Matters in Policy Optimization
☆21Nov 1, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
BY571 / D4PG
View on GitHub
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆24Apr 7, 2021Updated 5 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
desi-ivanova / idad
View on GitHub
Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods
☆25Dec 30, 2021Updated 4 years ago
joeybose / FloRL
View on GitHub
Implicit Normalizing Flows + Reinforcement Learning
☆62May 31, 2019Updated 7 years ago
tedmoskovitz / WNPG
View on GitHub
implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies
☆13Mar 9, 2021Updated 5 years ago
Shen-Lab / Bayesian-L2O
View on GitHub
[ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…
☆14Aug 19, 2022Updated 3 years ago
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
dominikalfke / TimeSeriesSSL
View on GitHub
A study of distance measures and learning methods for semi-supervised learning on time series data
☆17Jun 23, 2021Updated 5 years ago
astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
microsoft / HuRL
View on GitHub
Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper
☆17Jan 3, 2022Updated 4 years ago
atavakol / action-branching-agents
View on GitHub
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆122Feb 3, 2023Updated 3 years ago
Silvicek / distributional-dqn
View on GitHub
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132May 5, 2019Updated 7 years ago
google-deepmind / affordances_option_models
View on GitHub
☆22Nov 8, 2021Updated 4 years ago
zhangmazi1 / Data-preprocessing
View on GitHub
数据预处理——插值法填补缺失值，并且标记填充位置
☆10Apr 19, 2019Updated 7 years ago
tgangwani / BMIL
View on GitHub
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
☆22Aug 4, 2022Updated 3 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
quantumiracle / Consistency_Model_For_Reinforcement_Learning
View on GitHub
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆27Aug 28, 2024Updated last year
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
ling-pan / RES
View on GitHub
☆25Feb 21, 2022Updated 4 years ago
arushijain94 / SafeOptionCritic
View on GitHub
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆21Dec 16, 2018Updated 7 years ago
wyjung0625 / p3s
View on GitHub
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Jan 9, 2020Updated 6 years ago
nuria95 / O-RAAC
View on GitHub
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
☆36Feb 9, 2021Updated 5 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
RajGhugare19 / VE-principle-for-model-based-RL
View on GitHub
Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…
☆18Apr 13, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XuehaiPan / Soft-Actor-Critic
View on GitHub
PyTorch Implementation of Soft Actor-Critic Algorithm
☆12Sep 13, 2020Updated 5 years ago
taodav / nsrs
View on GitHub
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Jul 16, 2024Updated 2 years ago
jachiam / surprise
View on GitHub
Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
uoe-agents / seps
View on GitHub
Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)
☆25Oct 26, 2021Updated 4 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
BorealisAI / continuous-time-flow-process
View on GitHub
PyTorch code of "Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows" (NeurIPS 2020)
☆49Nov 5, 2020Updated 5 years ago
robotgradient / iflow
View on GitHub
☆25Oct 28, 2020Updated 5 years ago