Non-Slippery Frozen Lake with REINFORCE¶
In this notebook we solve a non-slippery version of the FrozenLake-v0 environment using the REINFORCE algorithm (Monte Carlo policy gradient). We’ll use a linear function approximator for our policy.
To view the notebook in a new tab, click here. To interact with the notebook in Google Colab, hit the “Open in Colab” button below.