Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆31Jul 27, 2021Updated 4 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆335Nov 29, 2021Updated 4 years ago
- ☆13May 4, 2023Updated 2 years ago
- Generalized Continuous Collision Detection Framework of Polynomial Trajectory☆19Jan 28, 2023Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- A new model-based algorithm for offline inverse reinforcement learning☆15Feb 20, 2023Updated 3 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆53Oct 16, 2024Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- ☆21Jun 27, 2024Updated last year
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- [AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning☆18May 21, 2025Updated 10 months ago
- ☆15Sep 4, 2025Updated 6 months ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- ☆13Feb 21, 2025Updated last year
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- ☆10Jun 5, 2024Updated last year
- ☆10Sep 19, 2021Updated 4 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- ☆10Feb 9, 2024Updated 2 years ago
- ☆10Jul 21, 2019Updated 6 years ago
- Code for ThriftyDAgger☆14Dec 29, 2021Updated 4 years ago
- ROS driver for DJI/Ryze Tello drones☆10Jun 23, 2021Updated 4 years ago
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆562Jan 24, 2023Updated 3 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- implementation of Advanced Encryption Standard (AES) Block Cipher☆12Jan 15, 2026Updated 2 months ago
- ☆18May 30, 2023Updated 2 years ago
- Code and Experiments for L4DC 2021 Paper: "Learning Visually Guided Latent Actions for Assistive Teleoperation"☆14May 4, 2021Updated 4 years ago
- News website template - fully responsive.☆10May 11, 2021Updated 4 years ago
- ☆16Jan 21, 2026Updated 2 months ago
- ☆13Feb 5, 2024Updated 2 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- simulation/RL - multi-agent car parking using reinforcement learning☆12Aug 4, 2024Updated last year
- ☆30Jan 27, 2025Updated last year
- ☆13Sep 19, 2023Updated 2 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- ☆14Oct 11, 2022Updated 3 years ago
- Teleoperation of Franka FR3 with Spacemouse☆27Jul 18, 2025Updated 8 months ago