mrahtz / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆307Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for learning-from-human-preferences
- ☆305Updated last year
- Reinforcement Learning with Deep Energy-Based Policies☆416Updated 11 months ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆365Updated 3 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆197Updated 3 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆206Updated 5 years ago
- ☆128Updated 6 years ago
- For educational materials related to the spinning up workshops.☆193Updated 5 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆232Updated 2 years ago
- Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch☆573Updated 2 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆534Updated last year
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆473Updated last year
- Code for the paper "Generative Adversarial Imitation Learning"☆691Updated 6 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆171Updated 3 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆199Updated 4 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- Code for the paper "Phasic Policy Gradient"☆252Updated last year
- A simple framework for experimenting with Reinforcement Learning in Python.☆272Updated 8 months ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated last month
- A customizable framework to create maze and gridworld environments☆260Updated 5 years ago
- Deep reinforcement learning model implementation in Tensorflow + OpenAI gym☆287Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆192Updated 2 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆654Updated 6 months ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆296Updated 3 years ago
- List of competitions related to Reinforcement Learning☆347Updated 10 months ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆590Updated 6 months ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆514Updated 3 years ago
- Random Network Distillation pytorch☆242Updated 5 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch☆346Updated 5 years ago