AaronJi / RL

A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
27Updated 3 years ago

Alternatives and similar repositories for RL:

Users that are interested in RL are comparing it to the libraries listed below