robintyh1 / onpolicybaselinesView external linksLinks
on-policy optimization baselines for deep reinforcement learning
☆32Apr 3, 2020Updated 5 years ago
Alternatives and similar repositories for onpolicybaselines
Users that are interested in onpolicybaselines are comparing it to the libraries listed below
Sorting:
- ICRL 2020☆20Feb 18, 2020Updated 5 years ago
- Regularization Matters in Policy Optimization☆21Nov 1, 2021Updated 4 years ago
- NOMU: Neural Optimization-based Model Uncertainty☆10Feb 17, 2023Updated 2 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Deep Structured Energy Based Model☆11Jan 6, 2018Updated 8 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- ☆16Mar 2, 2022Updated 3 years ago
- On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification☆21Apr 1, 2022Updated 3 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 6 months ago
- Bayesian and Maximum Likelihood Implementation of the Normalizing Flow Network (NFN): https://arxiv.org/abs/1907.08982☆22Dec 10, 2020Updated 5 years ago
- Contains legacy code and model examples for the paper "BayesFlow: Learning complex stochastic models with invertible neural networks"☆24Dec 20, 2020Updated 5 years ago
- [ICCV'19] DUAL-GLOW: Conditional Flow-Based Generative Model for Modality Transfer☆19Nov 25, 2022Updated 3 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Dec 16, 2018Updated 7 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- Stein Variational Policy Gradient for REINFORCE☆18Jul 12, 2017Updated 8 years ago
- A study of distance measures and learning methods for semi-supervised learning on time series data☆17Jun 23, 2021Updated 4 years ago
- Bayesian Optimization with Density-Ratio Estimation☆24Dec 26, 2022Updated 3 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆25Oct 26, 2021Updated 4 years ago
- This repo is for source code of NeurIPS 2021 paper "Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration".☆22Jan 4, 2022Updated 4 years ago
- PyTorch code of "Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows" (NeurIPS 2020)☆48Nov 5, 2020Updated 5 years ago
- Implementation of the Option-Critic Architecture☆40Dec 9, 2018Updated 7 years ago
- Official implementation for masked contrastive learning for anomaly detection.(IJCAI-21)☆18May 26, 2021Updated 4 years ago
- Random feature latent variable models in Python☆23Jul 23, 2023Updated 2 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago