vub-ai-lab / bdpi

Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
25Updated 5 years ago

Related projects: