mrahtz / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆317Updated 3 years ago
Alternatives and similar repositories for learning-from-human-preferences:
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆675Updated 10 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆196Updated 2 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆549Updated last year
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆369Updated 3 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆484Updated 2 years ago
- Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch☆600Updated 2 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch☆351Updated 5 years ago
- A collection of multi agent environments based on OpenAI gym.☆595Updated 8 months ago
- Reinforcement Learning with Deep Energy-Based Policies☆419Updated last year
- Clone of OpenAI's Spinning Up in PyTorch☆148Updated 2 years ago
- Structural implementation of RL key algorithms☆512Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆487Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆259Updated last year
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,000Updated 6 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆325Updated 5 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆361Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆526Updated 3 years ago
- ☆134Updated 7 years ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆621Updated 10 months ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆468Updated 11 months ago
- Actor-critic with experience replay☆252Updated 2 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆422Updated 3 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆236Updated 2 years ago
- Implementation of Meta-RL A3C algorithm☆402Updated 8 years ago
- List of competitions related to Reinforcement Learning☆352Updated last year
- ☆303Updated last year
- A Python interface for reinforcement learning environments☆363Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆203Updated last year
- A high-performance Atari A3C agent in 180 lines of PyTorch☆171Updated 3 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆207Updated 6 years ago