tbasaklar / PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Learning-Algorithm

A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preference space in a given domain.
25Updated last year

Related projects

Alternatives and complementary repositories for PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Learning-Algorithm