monoelh / deep-reinforcement-learning_DDQN_PPO_HER

MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
19Updated 6 years ago

Related projects: