Hybrid Reward Architecture
☆80May 2, 2018Updated 7 years ago
Alternatives and similar repositories for hra
Users that are interested in hra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Optimizers in tensorflow from scratch☆18Jun 6, 2017Updated 8 years ago
- ☆17Jul 3, 2017Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code to implement SIMILE algorithm from the paper entitled "Smooth Imitation Learning for Online Sequence Prediction" from ICML 2016☆13May 25, 2016Updated 9 years ago
- RobustStabilityGuaranteeRL☆11Aug 22, 2019Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆16Dec 22, 2017Updated 8 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- reproduce some RL or Multi-Agent models☆35May 22, 2019Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆88Mar 5, 2018Updated 8 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated last month
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- Meta Reinforcement Learning Experiments☆35Aug 22, 2017Updated 8 years ago
- Companion code for Closed-Loop Koopman Operator Approximation☆16Mar 24, 2024Updated 2 years ago
- code for icml paper: https://arxiv.org/abs/1711.03243v3☆12Jul 8, 2018Updated 7 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Jun 19, 2018Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Jan 15, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆24Oct 29, 2024Updated last year
- ☆13Dec 6, 2018Updated 7 years ago
- Implementation of modular composition network from https://arxiv.org/pdf/1711.11289.pdf☆25Dec 30, 2017Updated 8 years ago
- This repository includes the source code and dataset information needed to reproduce the results from our paper. For more information abo…☆28Feb 15, 2023Updated 3 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Jul 3, 2017Updated 8 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆96Jun 8, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation for Perspective Plane Program Induction from a Single Image (P3I).☆14Jun 25, 2020Updated 5 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Material for git workshop☆11Mar 13, 2018Updated 8 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Code for the paper "Evolved Policy Gradients"☆254Nov 22, 2018Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- ☆62Jun 22, 2018Updated 7 years ago