We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 3de843f commit 3c50615Copy full SHA for 3c50615
1 file changed
README.md
@@ -24,6 +24,7 @@ PPO and PPO_CNN agents playing Pong-v0 game:<br>
24
25
10. [Introduction to RL Asynchronous Advanced Actor Critic algorythm (A3C)](https://pylessons.com/A3C-reinforcement-learning/)
26
27
-11. [Introduction to RL Proximal Policy Optimization algorythm (PPO)](https://pylessons.com/PPO-reinforcement-learning/)<br>
+11. [Introduction to RL Proximal Policy Optimization algorythm (PPO)](https://pylessons.com/PPO-reinforcement-learning/)
28
+<br><br>
29
PPO Pong-v0 Learning curve:
30
<img src="11_Pong-v0_PPO/Pong-v0_APPO_0.0001_RMSprop.png" data-canonical-src="11_Pong-v0_PPO/Pong-v0_APPO_0.0001_RMSprop.png" width="500" height="300" />
0 commit comments