Non-Slippery Frozen Lake with Actor-Critic

In this notebook we solve a non-slippery version of the FrozenLake-v0 environment using the TD actor critic algorithm with PPO policy updates. We’ll use a linear function approximator for our policy and our state value function.

To view the notebook in a new tab, click here. To interact with the notebook in Google Colab, hit the “Open in Colab” button below.

Open in Colab