Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆31Jul 27, 2021Updated 4 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆333Nov 29, 2021Updated 4 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- A simple 1d simulator for the "Neural-Lander" paper, ICRA 2019☆19Feb 18, 2023Updated 3 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- ☆21Jun 27, 2024Updated last year
- Generalized Continuous Collision Detection Framework of Polynomial Trajectory☆19Jan 28, 2023Updated 3 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 6 years ago
- Implementing REINFORCE algorithm on Pong, Lunar Lander and Cartplot + Medium Article☆23Nov 24, 2020Updated 5 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆27May 4, 2021Updated 4 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆24Nov 4, 2024Updated last year
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- ☆53Nov 10, 2022Updated 3 years ago
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- ☆31Feb 20, 2021Updated 5 years ago
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- Reinforcement learning environment for UR5e robot with OPENAI gym like format. Include both simulation and real parts.☆14Nov 2, 2021Updated 4 years ago
- Implementations of Curious Replay for model-based adaptation.☆43Jul 5, 2023Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Reinforcement learning benchmarking.☆39Oct 22, 2018Updated 7 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- This is an unofficial python implement of nonlinear model predictive control with control Lyapunov functions and control barrier function…☆39Aug 25, 2021Updated 4 years ago
- research - multi-agent car parking using reinforcement learning☆12Aug 4, 2024Updated last year
- A multifunctional car using Openmv4 and Arduino, including Machine vision, Socket communication,WiFi graph transmission☆12Nov 25, 2020Updated 5 years ago
- Heatmap-based Out-of-Distribution Detection (WACV 2023)☆13Mar 27, 2024Updated last year
- Project for Elective in Robotics: Control of Multi-robot system, Univ. La Sapienza Roma, 2020.☆11Jan 25, 2021Updated 5 years ago
- A generic tensorflow library for robotics: a bridge between robotics problem and modern machine learning architecture. Provides forward k…☆13Apr 12, 2024Updated last year
- ☆10Feb 9, 2024Updated 2 years ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- A mouse brain histology tool for neuroscientists.☆13Feb 16, 2026Updated last week
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆39Dec 27, 2022Updated 3 years ago
- Source codes of Learning Causal Representations for Robust Domain Adaptation (IEEE TKDE)☆12Feb 14, 2022Updated 4 years ago
- [ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot☆46May 19, 2023Updated 2 years ago