Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
Alternatives and similar repositories for intrinsic-fear-dqn
Users that are interested in intrinsic-fear-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- TensorFlow implementation of Deep RL (Reinforcement Learning) papers based on deep Q-learning (DQN)☆10Mar 1, 2018Updated 8 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- 2019 Fall - Game theory and Multi-agent RL Termproject☆10Dec 13, 2019Updated 6 years ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Reinforcement Learning program that looks to be able to quickly learn to solve a Rubik's Cube☆15Jun 22, 2021Updated 4 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- ☆20Jul 16, 2019Updated 6 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Twitter follower graphs of @Die_Gruenen & @AfD, including cluster and topic analysis☆10Jul 10, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13May 30, 2019Updated 6 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 10 months ago
- scala multi-dimensional arrays with reverse-mode autodifferentiation☆18Nov 10, 2017Updated 8 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- ☆17Dec 4, 2019Updated 6 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- allennlp + streamlit demo☆21Oct 2, 2019Updated 6 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- A general implementation of a FFT, FIR and IIR filters and some other General Functions in a TMS320C5535 ezdsp including FFT and FIR, IIR…☆21Apr 20, 2016Updated 9 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 6 years ago
- Soccer toy example simulator used in Reinforcement Learning☆12Mar 11, 2018Updated 8 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A C# wrapper for the WORLD vocoder☆24Jun 21, 2021Updated 4 years ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Feb 23, 2021Updated 5 years ago
- The interface between probabilistic model checking and data-driven policy learning.☆17Mar 21, 2026Updated last week
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆29Dec 28, 2017Updated 8 years ago
- A web page to collect reproduced papers in one place with their codes☆14Mar 8, 2023Updated 3 years ago
- snake-gym is implementation of the classic game snake that is made as an OpenAI gym environment☆24Jul 25, 2024Updated last year