mrahtz / learning-from-human-preferencesLinks
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆320Updated 3 years ago
Alternatives and similar repositories for learning-from-human-preferences
Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below
Sorting:
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆203Updated 4 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆201Updated 2 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆687Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆550Updated last year
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 5 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆425Updated last year
- List of competitions related to Reinforcement Learning☆349Updated last year
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago
- ☆134Updated 7 years ago
- Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch☆615Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆262Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆500Updated 2 years ago
- A Python interface for reinforcement learning environments☆368Updated 2 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆345Updated 5 years ago
- World Models Experiments☆649Updated 2 years ago
- ☆198Updated 2 years ago
- ☆303Updated 2 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆433Updated 2 years ago
- Real-World RL Benchmark Suite☆354Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆237Updated 2 years ago
- A PyTorch Platform for Distributed RL☆747Updated 3 years ago
- A simple framework for experimenting with Reinforcement Learning in Python.☆313Updated last year
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆370Updated 3 years ago
- For educational materials related to the spinning up workshops.☆200Updated 6 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆202Updated 5 years ago
- ICML 2018 Self-Imitation Learning☆278Updated 5 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆571Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆543Updated 3 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆474Updated last year