mrahtz / learning-from-human-preferencesView external linksLinks
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆333Nov 29, 2021Updated 4 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Learning From Human Preferences - Tensorflow+Keras Implementation☆18Aug 17, 2017Updated 8 years ago
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆562Jan 24, 2023Updated 3 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Sep 1, 2022Updated 3 years ago
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆28Jan 22, 2019Updated 7 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Dec 13, 2019Updated 6 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- ☆53Nov 10, 2022Updated 3 years ago
- Easy TensorFlow logging for quick prototypes☆110Oct 20, 2021Updated 4 years ago
- Implementation of the TAMER algorithm from "Interactively Shaping Agents via Human Reinforcement" (Knox, Stone - 2009)☆21May 6, 2020Updated 5 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆133Nov 3, 2021Updated 4 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- StarCraft: BroodWars OpenAI Gym environment☆84Jan 8, 2019Updated 7 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Machine Learning Course Project Skoltech 2018☆108Feb 11, 2019Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,123Oct 13, 2017Updated 8 years ago
- ☆10Mar 22, 2021Updated 4 years ago
- Multi Agent Reinforcement Learning using MalmÖ☆265Apr 14, 2020Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 4 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Feb 16, 2020Updated 5 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆67Jun 21, 2018Updated 7 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆34Oct 28, 2020Updated 5 years ago
- [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!☆104Dec 8, 2022Updated 3 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆209May 20, 2021Updated 4 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Oct 25, 2021Updated 4 years ago
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆772Dec 22, 2023Updated 2 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,040Jun 10, 2023Updated 2 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆435Nov 28, 2023Updated 2 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆206Nov 22, 2018Updated 7 years ago