Reward Learning by Simulating the Past
☆46May 9, 2019Updated 7 years ago
Alternatives and similar repositories for rlsp
Users that are interested in rlsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 6 years ago
- Package for evaluating the performance of methods which aim to increase fairness, accountability and/or transparency☆24Apr 5, 2026Updated 2 months ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- NAIL is an agent that plays text-based interactive fiction games.☆47Jul 25, 2023Updated 2 years ago
- ☆17Oct 13, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Sep 15, 2017Updated 8 years ago
- ☆11Jun 2, 2021Updated 5 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18May 24, 2017Updated 9 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆29Jan 22, 2019Updated 7 years ago
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆18Jan 4, 2023Updated 3 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆22Jan 10, 2019Updated 7 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- ROS publisher for Kitti dataset☆12Nov 27, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- ☆14Jun 9, 2019Updated 7 years ago
- Formal Contracts for Multi-Agent Reinforcement Learning☆20Oct 24, 2023Updated 2 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆30Jun 17, 2019Updated 6 years ago
- Generalised UDRL☆37May 12, 2022Updated 4 years ago
- A small bookmarks app for Solid☆11Jul 13, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆119Dec 13, 2019Updated 6 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61May 13, 2021Updated 5 years ago
- Source materials for CoinFT☆34Jan 23, 2026Updated 4 months ago
- Path integral quantum Monte Carlo☆29Mar 5, 2014Updated 12 years ago
- ☆13Nov 5, 2024Updated last year
- An outdoor environment simulator with real-world imagery for Deep Reinforcement Learning on navigation tasks.☆30Apr 11, 2023Updated 3 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Jan 11, 2019Updated 7 years ago
- TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"☆30Jun 10, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 5 years ago
- ☆20Apr 3, 2023Updated 3 years ago
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- A Scalable Approximate Method for Probabilistic Neurosymbolic Inference☆25Jan 27, 2025Updated last year
- Canonical normalizing flows☆10Apr 30, 2019Updated 7 years ago