on-policy optimization baselines for deep reinforcement learning
☆32Apr 3, 2020Updated 5 years ago
Alternatives and similar repositories for onpolicybaselines
Users that are interested in onpolicybaselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- ICRL 2020☆20Feb 18, 2020Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆78Aug 13, 2020Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Regularization Matters in Policy Optimization☆21Nov 1, 2021Updated 4 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆22Dec 30, 2021Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆16Aug 26, 2023Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Deep Structured Energy Based Model☆11Jan 6, 2018Updated 8 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- 数据预处理——插值法填补缺失值,并且标记填充位置☆10Apr 19, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆48Sep 20, 2023Updated 2 years ago
- NOMU: Neural Optimization-based Model Uncertainty☆10Feb 17, 2023Updated 3 years ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- ☆25Feb 21, 2022Updated 4 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Dec 16, 2018Updated 7 years ago
- Companion code for the paper "Learnable Uncertainty under Laplace Approximations" (UAI 2021).☆19Jun 8, 2021Updated 4 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- Symbolic Modelling of Highway Traffic Networks with METANET☆11May 27, 2025Updated 9 months ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year