Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot Learning (CoRL), Zurich, Switzerland, Oct. 2018.
☆30May 29, 2019Updated 6 years ago
Alternatives and similar repositories for batch-active-preference-based-learning
Users that are interested in batch-active-preference-based-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Companion code to CoRL 2019 paper: E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh. "Asking Easy Questions: A User-Friendly Approach to…☆18Oct 13, 2020Updated 5 years ago
- Companion code for RSS 2020 paper: "Active Preference-Based Gaussian Process Regression for Reward Learning"☆39Mar 24, 2024Updated 2 years ago
- ☆37Nov 10, 2016Updated 9 years ago
- A Library for Active Preference-based Reward Learning Algorithms☆55Dec 16, 2023Updated 2 years ago
- A re-implementation of the Pommerman environment in C++☆11Oct 6, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆20Jan 31, 2018Updated 8 years ago
- Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).☆13Nov 16, 2021Updated 4 years ago
- Code for Contrastive Preference Learning (CPL)☆181Nov 22, 2024Updated last year
- Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv pr…☆15Jul 25, 2024Updated last year
- ☆41Oct 18, 2018Updated 7 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Dec 13, 2019Updated 6 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.☆13Nov 4, 2021Updated 4 years ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Oct 8, 2018Updated 7 years ago
- ☆37Apr 27, 2023Updated 3 years ago
- ☆21Aug 14, 2017Updated 8 years ago
- Original code for the paper "Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping" by Mezghani et al.☆18Jun 8, 2023Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆136Nov 3, 2021Updated 4 years ago
- ☆19Feb 11, 2022Updated 4 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- ☆16Oct 22, 2019Updated 6 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Jun 20, 2019Updated 6 years ago
- ☆17Jul 29, 2020Updated 5 years ago
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- This is an example of the design-by-contract method☆14Dec 27, 2022Updated 3 years ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 7 years ago
- Companion Codebase for "No, to the Right – Online Language Corrections for Robotic Manipulation via Shared Autonomy"☆28Dec 13, 2022Updated 3 years ago
- A lightweight research framework☆28Oct 14, 2025Updated 7 months ago
- Tensorflow implementation of ResNet☆14Jul 22, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the PAC-Bayes Control paper.☆13May 23, 2023Updated 3 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- The Multiagent Decision Process (MADP) Toolbox - planning and learning in multiagent systems.☆85Oct 14, 2020Updated 5 years ago
- Implementation of the most important parts of the Lottery Ticket Hypothesis Paper☆12Jul 2, 2018Updated 7 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- Task Success is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors☆13Aug 11, 2024Updated last year