Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆335Nov 29, 2021Updated 4 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Learning From Human Preferences - Tensorflow+Keras Implementation☆18Aug 17, 2017Updated 8 years ago
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆562Jan 24, 2023Updated 3 years ago
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆29Jan 22, 2019Updated 7 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Sep 1, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Dec 13, 2019Updated 6 years ago
- Implementation of the TAMER algorithm from "Interactively Shaping Agents via Human Reinforcement" (Knox, Stone - 2009)☆21May 6, 2020Updated 5 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- ☆53Nov 10, 2022Updated 3 years ago
- Easy TensorFlow logging for quick prototypes☆110Oct 20, 2021Updated 4 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆134Nov 3, 2021Updated 4 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- A Library for Active Preference-based Reward Learning Algorithms☆53Dec 16, 2023Updated 2 years ago
- ☆86Apr 10, 2021Updated 4 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆154Oct 25, 2021Updated 4 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,122Oct 13, 2017Updated 8 years ago
- Library to compare and evaluate reward functions☆68Oct 23, 2023Updated 2 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Feb 16, 2020Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- A new model-based algorithm for offline inverse reinforcement learning☆15Feb 20, 2023Updated 3 years ago
- Multi Agent Reinforcement Learning using MalmÖ☆265Apr 14, 2020Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆67Jun 21, 2018Updated 7 years ago
- Publicly releasable baselines for the Retro contest☆130Nov 22, 2018Updated 7 years ago
- A fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆4,324Sep 4, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- Clean PyTorch implementations of imitation and reward learning algorithms☆1,711Jan 7, 2025Updated last year
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 6 years ago
- StarCraft: BroodWars OpenAI Gym environment☆84Jan 8, 2019Updated 7 years ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆777Dec 22, 2023Updated 2 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,050Jun 10, 2023Updated 2 years ago
- Bayesian Inverse Reinforcement Learning with simple environments☆19May 17, 2022Updated 3 years ago