ZachisGit / LearningFromHumanPreferencesLinks

Learning From Human Preferences - Tensorflow+Keras Implementation

☆18

Alternatives and similar repositories for LearningFromHumanPreferences

Users that are interested in LearningFromHumanPreferences are comparing it to the libraries listed below

Sorting:

zuoxingdong / dm2gym
Convert DeepMind Control Suite to OpenAI gym environments.
☆86Updated 5 years ago
BerkeleyAutomation / DART
☆49Updated 5 years ago
rddy / isql
Inferring beliefs about dynamics from behavior
☆29Updated 7 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 7 years ago
kindredresearch / arp
Autoregressive policies for continuous control reinforcement learning
☆32Updated 6 years ago
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
hiwonjoon / ICML2019-TREX
☆83Updated 4 years ago
aravindsrinivas / upn
☆33Updated 7 years ago
dibyaghosh / dnc
Code for "Divide-and-Conquer Reinforcement Learning"
☆61Updated 6 years ago
sharadmv / parasol
☆68Updated 3 years ago
AIcrowd / real_robots
Gym environments for Robots that learn to interact with the environment autonomously
☆34Updated 2 years ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 9 years ago
GOAL-Robots / REALCompetitionStartingKit
☆35Updated 5 years ago
brain-research / LeaveNoTrace
Leave No Trace is an algorithm for safe reinforcement learning.
☆15Updated 7 years ago
nnaisense / MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆79Updated 5 years ago
qxcv / magical
The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)
☆77Updated last year
rddy / deepassist
Shared autonomy via deep reinforcement learning
☆78Updated 2 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
VincentYu68 / policy_transfer
☆28Updated 5 years ago
russellmendonca / GMPS
Guided-Meta Policy Search
☆39Updated 2 years ago
mcmachado / options
☆43Updated 8 years ago
vitchyr / viskit
rllab's viskit with some added features
☆73Updated 2 years ago
chloechsu / revisiting-ppo
☆47Updated 4 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆69Updated 5 years ago
dannysdeng / dqn-pytorch
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Updated 5 years ago
stanford-iprl-lab / GRAC
implementation of our self-guided and self-regularized actor-critic algorithm
☆30Updated 2 years ago
TLESORT / State-Representation-Learning-An-Overview
Simplified version of "State Representation Learning for Control: An Overview" bibliography
☆34Updated 6 years ago
mcgillmrl / kusanagi
Library for model based RL in robotics
☆37Updated 6 years ago
paulorauber / hpg
Hindsight policy gradients
☆45Updated 5 years ago
edbeeching / 3d_control_deep_rl
Baselines and memory-based scenarios for the ViZDoom simulator
☆35Updated 2 years ago