PLE environmentΒΆ

This environment is an interface with the PLE environment. The provided example shows how to successfully learn a good policy on the simple "catcher" game in a few epochs (~10). You should easily be able to learn successful policies for all the games provided (possibly with some hyper-parameters tuning).