Gym environment
ΒΆ
Some examples are also provided with the Gym environment.
Here is the resulting policy for the mountain car example:

Here is the resulting policy for the pendulum example:

Gym environment
ΒΆSome examples are also provided with the Gym environment.
Here is the resulting policy for the mountain car example:
Here is the resulting policy for the pendulum example: