Reward Learning by Simulating the Past
☆46May 9, 2019Updated 6 years ago
Alternatives and similar repositories for rlsp
Users that are interested in rlsp are comparing it to the libraries listed below
Sorting:
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- Package for evaluating the performance of methods which aim to increase fairness, accountability and/or transparency☆24Feb 19, 2026Updated last month
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- NAIL is an agent that plays text-based interactive fiction games.☆47Jul 25, 2023Updated 2 years ago
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18May 24, 2017Updated 8 years ago
- ☆26Nov 2, 2017Updated 8 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆29Jan 22, 2019Updated 7 years ago
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆17Jan 4, 2023Updated 3 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆23Jan 10, 2019Updated 7 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- ROS publisher for Kitti dataset☆12Nov 27, 2016Updated 9 years ago
- Formal Contracts for Multi-Agent Reinforcement Learning☆19Oct 24, 2023Updated 2 years ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- ☆11Mar 13, 2023Updated 3 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Dec 16, 2018Updated 7 years ago
- A small bookmarks app for Solid☆11Jul 13, 2017Updated 8 years ago
- Web effectivethesis.com (and old version of efektivni-altruismus.cz)☆10Feb 22, 2022Updated 4 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61May 13, 2021Updated 4 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Path integral quantum Monte Carlo☆27Mar 5, 2014Updated 12 years ago
- An outdoor environment simulator with real-world imagery for Deep Reinforcement Learning on navigation tasks.☆30Apr 11, 2023Updated 2 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Jan 11, 2019Updated 7 years ago
- TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"☆30Jun 10, 2018Updated 7 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago
- ☆20Apr 3, 2023Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- Source code for the AnimalAI environment☆11Oct 1, 2019Updated 6 years ago
- Canonical normalizing flows☆10Apr 30, 2019Updated 6 years ago