Click here to Skip to main content
15,861,125 members
Articles / Artificial Intelligence / Machine Learning

Training a Humanoid AI Robot to Walk Using Proximal Policy Optimisation (PPO)

Rate me:
Please Sign up or sign in to vote.
5.00/5 (3 votes)
29 Sep 2020CPOL4 min read 8.3K   2  
In this article in the series we start to focus on one particular, more complex environment that PyBullet makes available: Humanoid, in which we must train a human-like agent to walk on two legs.
Here we are using the Proximal Policy Optimisation (PPO) algorithm. We look at: the history of the humanoid environment for reinforcement learning, an introduction to Proximal Policy Optimisation (PPO), and the particular learning parameters that we override.

Views

Daily Counts

This article is part of the series 'Teach a Robot to Walk Deep Reinforcement Learning View All

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Web Developer
United Kingdom United Kingdom
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions