Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆31Jul 27, 2021Updated 4 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆337Nov 29, 2021Updated 4 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- ☆13May 4, 2023Updated 3 years ago
- Generalized Continuous Collision Detection Framework of Polynomial Trajectory☆19Jan 28, 2023Updated 3 years ago
- A new model-based algorithm for offline inverse reinforcement learning☆15Feb 20, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Decision Transformer for solving optimal EV charging problems using offline data.☆18Jan 19, 2026Updated 4 months ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆56Oct 16, 2024Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated 2 years ago
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆12Feb 19, 2024Updated 2 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- ☆12Feb 21, 2025Updated last year
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- ☆10Jun 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Sep 19, 2021Updated 4 years ago
- A multi-agent environment using Unity ML-Agents Toolkit☆10Dec 9, 2020Updated 5 years ago
- ☆37Apr 27, 2023Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 7 years ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- ROS driver for DJI/Ryze Tello drones☆10Jun 23, 2021Updated 4 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- implementation of Advanced Encryption Standard (AES) Block Cipher☆13Jan 15, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18May 30, 2023Updated 3 years ago
- Code and Experiments for L4DC 2021 Paper: "Learning Visually Guided Latent Actions for Assistive Teleoperation"☆13May 4, 2021Updated 5 years ago
- REEF Estimator is an open source velocity and altitude estimator and controller for multi-rotors. This repo will help pull all the necess…☆11Oct 9, 2023Updated 2 years ago
- Literature and code for inverse reinforcement leanring research☆31Mar 6, 2020Updated 6 years ago
- [TMLR 2025] A collection of research papers on constraint inference within the field of RL☆11May 9, 2025Updated last year
- News website template - fully responsive.☆10May 11, 2021Updated 5 years ago
- ☆14Feb 5, 2024Updated 2 years ago
- 哔哩哔哩常用API调用。☆17Aug 5, 2023Updated 2 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official open-source implementation of ICML 2022 paper: Reachability Constrainted Reinforcement Learning.☆42Jul 28, 2022Updated 3 years ago
- A PyTorch implementation for the paper 'Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observatio…☆14Sep 22, 2021Updated 4 years ago
- multi-agent car parking using reinforcement learning☆12Aug 4, 2024Updated last year
- ☆13Dec 3, 2023Updated 2 years ago
- ☆30Jan 27, 2025Updated last year
- Learning From Human Preferences - Tensorflow+Keras Implementation☆18Aug 17, 2017Updated 8 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Sep 1, 2022Updated 3 years ago