HumanCompatibleAI / seals

Benchmark environments for reward modelling and imitation learning algorithms.
β˜†44Updated last year

Related projects: β“˜