HumanCompatibleAI/rlsp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HumanCompatibleAI/rlsp)

HumanCompatibleAI / rlsp

Reward Learning by Simulating the Past

☆46

Alternatives and similar repositories for rlsp

Users that are interested in rlsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wearepal / EthicML
View on GitHub
Package for evaluating the performance of methods which aim to increase fairness, accountability and/or transparency
☆24Apr 5, 2026Updated 3 months ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
microsoft / nail_agent
View on GitHub
NAIL is an agent that plays text-based interactive fiction games.
☆47Jul 25, 2023Updated 2 years ago
thunder112358 / disentanglement-representation-using-vaes
View on GitHub
☆17Oct 13, 2019Updated 6 years ago
HumanCompatibleAI / interpreting-rewards
View on GitHub
Experiments in applying interpretability techniques to learned reward functions.
☆10Dec 11, 2020Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
jbkjr / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14May 17, 2024Updated 2 years ago
mansimov / acktr
View on GitHub
☆17Sep 15, 2017Updated 8 years ago
alexander-turner / attainable-utility-preservation
View on GitHub
☆11Jun 2, 2021Updated 5 years ago
HumanCompatibleAI / learning_biases
View on GitHub
Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.
☆25Sep 26, 2020Updated 5 years ago
arayabrain / FewshotClassifier
View on GitHub
Code for the blog post on few-shot classification via task representation and communication.
☆18May 24, 2017Updated 9 years ago
fair-preprocessing / nips2017
View on GitHub
☆26Nov 2, 2017Updated 8 years ago
mariacer / strong_dfc
View on GitHub
Minimizing Control for Credit Assignment with Strong Feedback
☆14Nov 3, 2024Updated last year
PR2 / pr2_apps
View on GitHub
☆14Aug 16, 2022Updated 3 years ago
machine-intelligence / rl-teacher-atari
View on GitHub
(This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…
☆29Jan 22, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
VITA-Group / DiffSES
View on GitHub
[TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…
☆18Jan 4, 2023Updated 3 years ago
voiler / unreal
View on GitHub
Reinforcement learning with unsupervised auxiliary tasks
☆22Jan 10, 2019Updated 7 years ago
tomas789 / kitti_player
View on GitHub
ROS publisher for Kitti dataset
☆12Nov 27, 2016Updated 9 years ago
Jamesken23 / Mechine-Learning
View on GitHub
☆26Feb 19, 2020Updated 6 years ago
ofirpress / PartialShuffle
View on GitHub
☆14Jun 9, 2019Updated 7 years ago
Algorithmic-Alignment-Lab / contracts
View on GitHub
Formal Contracts for Multi-Agent Reinforcement Learning
☆20Oct 24, 2023Updated 2 years ago
aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
ganow / keras-information-dropout
View on GitHub
Keras implementation of the Information Dropout (arXiv:1611.01353) paper
☆15Dec 31, 2016Updated 9 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Wenxuan-Zhou / EPI
View on GitHub
Code for Environment Probing Interaction Policies [ICLR 2019]
☆30Jun 17, 2019Updated 7 years ago
arushijain94 / SafeOptionCritic
View on GitHub
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆21Dec 16, 2018Updated 7 years ago
Kaixhin / GUDRL
View on GitHub
Generalised UDRL
☆37May 12, 2022Updated 4 years ago
dan-f / mark
View on GitHub
A small bookmarks app for Solid
☆11Jul 13, 2017Updated 9 years ago
pathak22 / modular-assemblies
View on GitHub
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
☆120Dec 13, 2019Updated 6 years ago
PartnershipOnAI / safelife
View on GitHub
SafeLife: safety benchmarks for reinforcement learning agents
☆61May 13, 2021Updated 5 years ago
anch3or / Optimization-Notes
View on GitHub
《最优化导论》第1 2 3 4 5 6 7 8 9 10 11 13 20 21 22 23章LaTeX公式笔记
☆41Dec 5, 2018Updated 7 years ago
mweiss17 / SEVN
View on GitHub
An outdoor environment simulator with real-world imagery for Deep Reinforcement Learning on navigation tasks.
☆30Apr 11, 2023Updated 3 years ago
dilinwang820 / adaptive-f-divergence
View on GitHub
A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"
☆20Jan 11, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TheShadow29 / infnet-spen
View on GitHub
TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"
☆30Jun 10, 2018Updated 8 years ago
neale / avoiding-side-effects
View on GitHub
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Jun 3, 2021Updated 5 years ago
bojone / vib
View on GitHub
Variational Information Bottleneck
☆16Nov 26, 2018Updated 7 years ago
procopiostein / ompl_planner_base
View on GitHub
ROS OMPL base planner
☆14Feb 4, 2016Updated 10 years ago
beyretb / AnimalAI-Environment
View on GitHub
Source code for the AnimalAI environment
☆11Oct 1, 2019Updated 6 years ago
avisingh599 / reward-learning-rl
View on GitHub
[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
☆374Nov 22, 2022Updated 3 years ago
rbondesan / CanonicalFlows
View on GitHub
Canonical normalizing flows
☆10Apr 30, 2019Updated 7 years ago