mrahtz / learning-from-human-preferencesLinks
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆323Updated 3 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
Sorting:
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆370Updated last year
- ☆134Updated 7 years ago
- A simple framework for experimenting with Reinforcement Learning in Python.☆316Updated last year
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆691Updated last year
- Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch☆617Updated 2 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆551Updated 2 years ago
- World Models Experiments☆650Updated 2 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆203Updated 4 years ago
- ☆303Updated 2 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆300Updated last year
- Deep reinforcement learning model implementation in Tensorflow + OpenAI gym☆297Updated 2 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 5 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆427Updated last year
- A customizable framework to create maze and gridworld environments☆268Updated 6 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆201Updated 2 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆398Updated last year
- For educational materials related to the spinning up workshops.☆201Updated 6 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆545Updated 3 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆501Updated 2 years ago
- A Python interface for reinforcement learning environments☆371Updated 2 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆573Updated 2 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆496Updated 2 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆203Updated 6 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆371Updated 3 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆209Updated 6 years ago
- Repo for reproduction of sequential social dilemmas☆403Updated 4 months ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆639Updated last year
- PlayGround: AI Research into Multi-Agent Learning.☆772Updated last year