Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆31Jul 27, 2021Updated 4 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆336Nov 29, 2021Updated 4 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- ☆13May 4, 2023Updated 3 years ago
- Generalized Continuous Collision Detection Framework of Polynomial Trajectory☆19Jan 28, 2023Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A new model-based algorithm for offline inverse reinforcement learning☆15Feb 20, 2023Updated 3 years ago
- [IEEE IOT-J] Official Repository for The Paper, CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network☆20Sep 29, 2025Updated 7 months ago
- A Decision Transformer for solving optimal EV charging problems using offline data.☆18Jan 19, 2026Updated 4 months ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆56Oct 16, 2024Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated 2 years ago
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- A simple 1d simulator for the "Neural-Lander" paper, ICRA 2019☆21Feb 18, 2023Updated 3 years ago
- Bayesian Inverse Reinforcement Learning with simple environments☆19May 17, 2022Updated 4 years ago
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆12Feb 19, 2024Updated 2 years ago
- [AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning☆18May 21, 2025Updated last year
- ☆15Sep 4, 2025Updated 8 months ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- Tools to Support OpenAtlas development☆13Jul 9, 2019Updated 6 years ago
- ☆12Feb 21, 2025Updated last year
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- ☆37Apr 27, 2023Updated 3 years ago
- Implementing REINFORCE algorithm on Pong, Lunar Lander and Cartplot + Medium Article☆23Nov 24, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 7 years ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- Code for ThriftyDAgger☆14Dec 29, 2021Updated 4 years ago
- Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback☆561Jan 24, 2023Updated 3 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- ☆18May 30, 2023Updated 2 years ago
- Code and Experiments for L4DC 2021 Paper: "Learning Visually Guided Latent Actions for Assistive Teleoperation"☆13May 4, 2021Updated 5 years ago
- Literature and code for inverse reinforcement leanring research☆31Mar 6, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [TMLR 2025] A collection of research papers on constraint inference within the field of RL☆11May 9, 2025Updated last year
- ☆13Feb 5, 2024Updated 2 years ago
- 哔哩哔哩常用API调用。☆17Aug 5, 2023Updated 2 years ago
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆42Jul 28, 2022Updated 3 years ago
- A PyTorch implementation for the paper 'Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observatio…☆14Sep 22, 2021Updated 4 years ago
- multi-agent car parking using reinforcement learning☆12Aug 4, 2024Updated last year
- ☆13Dec 3, 2023Updated 2 years ago