Training a Humanoid AI Robot to Walk Using Proximal Policy Optimisation (PPO)

philoxenic

Rate me:

5.00/5 (3 votes)

29 Sep 2020CPOL4 min read

8.6K

In this article in the series we start to focus on one particular, more complex environment that PyBullet makes available: Humanoid, in which we must train a human-like agent to walk on two legs.

Here we are using the Proximal Policy Optimisation (PPO) algorithm. We look at: the history of the humanoid environment for reinforcement learning, an introduction to Proximal Policy Optimisation (PPO), and the particular learning parameters that we override.

Views

Daily Counts

This article is part of the series 'Teach a Robot to Walk Deep Reinforcement Learning ◁ Prev View All Next ▷

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Written By

philoxenic

Web Developer

United Kingdom

This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Training a Humanoid AI Robot to Walk Using Proximal Policy Optimisation (PPO)

Views

License

Comments and Discussions