Reward Learning by Simulating the Past
☆46May 9, 2019Updated 6 years ago
Alternatives and similar repositories for rlsp
Users that are interested in rlsp are comparing it to the libraries listed below
Sorting:
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- NAIL is an agent that plays text-based interactive fiction games.☆47Jul 25, 2023Updated 2 years ago
- ☆17Sep 15, 2017Updated 8 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18May 24, 2017Updated 8 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- Fully connected neural nets for supervised learning DQMC data☆12Jul 13, 2016Updated 9 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- ROS publisher for Kitti dataset☆12Nov 27, 2016Updated 9 years ago
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆28Jan 22, 2019Updated 7 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- Source code for the AnimalAI environment☆11Oct 1, 2019Updated 6 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- ☆28Mar 13, 2019Updated 6 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 7 years ago
- A Scalable Approximate Method for Probabilistic Neurosymbolic Inference☆23Jan 27, 2025Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14May 17, 2024Updated last year
- ☆14Aug 16, 2022Updated 3 years ago
- ☆14Jun 9, 2019Updated 6 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Formal Contracts for Multi-Agent Reinforcement Learning☆19Oct 24, 2023Updated 2 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- web browser based rosbag manager☆16Sep 23, 2022Updated 3 years ago
- imperative programming in TensorFlow☆18Dec 12, 2016Updated 9 years ago
- HEBI ROS Examples/API/etc.☆19Aug 31, 2020Updated 5 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- This repository contains the source code used to produce the results presented in the paper "Near-deterministic production of universal …☆22Jul 10, 2019Updated 6 years ago
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆16Dec 20, 2018Updated 7 years ago
- ☆16Sep 20, 2016Updated 9 years ago
- ☆19Mar 22, 2023Updated 2 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- ☆20Sep 7, 2019Updated 6 years ago
- A collection of MuJoCo based environments.☆20Nov 30, 2020Updated 5 years ago