Reward Learning by Simulating the Past
☆46May 9, 2019Updated 6 years ago
Alternatives and similar repositories for rlsp
Users that are interested in rlsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- Package for evaluating the performance of methods which aim to increase fairness, accountability and/or transparency☆24Apr 5, 2026Updated 3 weeks ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- NAIL is an agent that plays text-based interactive fiction games.☆47Jul 25, 2023Updated 2 years ago
- ☆17Oct 13, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- ☆17Sep 15, 2017Updated 8 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- ☆26Nov 2, 2017Updated 8 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆29Jan 22, 2019Updated 7 years ago
- ☆14Aug 16, 2022Updated 3 years ago
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆17Jan 4, 2023Updated 3 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆23Jan 10, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Feb 19, 2020Updated 6 years ago
- ROS publisher for Kitti dataset☆12Nov 27, 2016Updated 9 years ago
- ☆14Jun 9, 2019Updated 6 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆11Feb 19, 2024Updated 2 years ago
- Formal Contracts for Multi-Agent Reinforcement Learning☆20Oct 24, 2023Updated 2 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Dec 16, 2018Updated 7 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- A small bookmarks app for Solid☆11Jul 13, 2017Updated 8 years ago
- Web effectivethesis.com (and old version of efektivni-altruismus.cz)☆10Feb 22, 2022Updated 4 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆118Dec 13, 2019Updated 6 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61May 13, 2021Updated 4 years ago
- Source materials for CoinFT☆33Jan 23, 2026Updated 3 months ago
- 《最优化导论》第1 2 3 4 5 6 7 8 9 10 11 13 20 21 22 23章LaTeX公式笔记☆41Dec 5, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Nov 5, 2024Updated last year
- An outdoor environment simulator with real-world imagery for Deep Reinforcement Learning on navigation tasks.☆30Apr 11, 2023Updated 3 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Jan 11, 2019Updated 7 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- ☆20Apr 3, 2023Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆89Jul 9, 2020Updated 5 years ago
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago