mrahtz / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆318Updated 3 years ago
Alternatives and similar repositories for learning-from-human-preferences:
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆546Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆536Updated 3 years ago
- World Models Experiments☆639Updated 2 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆422Updated last year
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆472Updated last year
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆684Updated 11 months ago
- Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch☆608Updated 2 years ago
- Keeping track of RL experiments☆161Updated 2 years ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆627Updated 11 months ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆489Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆440Updated 6 years ago
- A Python interface for reinforcement learning environments☆364Updated 2 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆487Updated 2 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆421Updated 2 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆201Updated 3 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆446Updated last year
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 4 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆370Updated 3 years ago
- List of competitions related to Reinforcement Learning☆351Updated last year
- Code for the paper "Generative Adversarial Imitation Learning"☆711Updated 6 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆366Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆200Updated 2 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆315Updated 3 years ago
- Tools for accelerating safe exploration research.☆532Updated 2 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆248Updated 3 years ago
- A simple framework for experimenting with Reinforcement Learning in Python.☆311Updated last year
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆695Updated 2 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch☆353Updated 6 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago
- A customizable framework to create maze and gridworld environments☆265Updated 6 years ago