The flow chart of the PPO algorithm.

<div><p>In order to solve the problems of high dependence on the accuracy of environmental model and poor environmental adaptability of traditional control methods, the robot constant force grinding controller that based on proximal policy optimization was proposed. Training the controll...

Full description

Saved in:

Bibliographic Details
Main Author:	Qichao Wang (5132438) (author)
Other Authors:	Linlin Chen (84486) (author), Qun Sun (806350) (author), Chong Wang (120449) (author), Yanxia Wei (848911) (author)
Published:	2025
Subjects:	Biotechnology Ecology Cancer Science Policy Space Science Environmental Sciences not elsewhere classified Biological Sciences not elsewhere classified Mathematical Sciences not elsewhere classified Information Systems not elsewhere classified traditional control methods proximal policy optimization perceivable force information simulation results demonstrate poor environmental adaptability grinding force difference controller trained using simulation model grinding robot environmental model xlink "> high dependence environment model controller model
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	<div><p>In order to solve the problems of high dependence on the accuracy of environmental model and poor environmental adaptability of traditional control methods, the robot constant force grinding controller that based on proximal policy optimization was proposed. Training the controller model between grinding force difference and end-effector compensation displacement using the proximal policy optimization algorithm. Complete compensation using robot inverse kinematics. In order to validate the algorithm, a simulation model of the grinding robot with perceivable force information is established. The simulation results demonstrate that the controller trained using this algorithm can achieve constant force grinding without setting up the environment model in advance and has some environmental adaptability.</p></div>

The flow chart of the PPO algorithm.

Similar Items