HumanCompatibleAI / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆27Updated 3 years ago
Alternatives and similar repositories for learning-from-human-preferences:
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- A Library for Active Preference-based Reward Learning Algorithms☆49Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- ☆45Updated last month
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 7 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- behavior cloning from observation☆35Updated 4 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆37Updated 11 months ago
- ☆33Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Implementations of SAILR, PDO, and CSC☆31Updated 7 months ago
- ☆46Updated 2 years ago
- SocialGym 2: A lightweight benchmark and simulator for multi-robot social navigation using ROS and the OpenAI gym.☆54Updated 10 months ago
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆55Updated last year
- PyTorch implementation of GAIL and PPO reinforcement learning algorithms☆23Updated 3 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 2 months ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆21Updated 3 years ago
- 🔥 Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆21Updated this week
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆44Updated last year
- ☆53Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆39Updated 3 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆33Updated 2 years ago
- ☆25Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆36Updated 2 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆31Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆69Updated 2 weeks ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆80Updated last year
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆26Updated 11 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆64Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆84Updated 2 years ago