http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
Hu Duoxiu,Dong Wenhan,Xie Wujie,He Lei 한국항공우주학회 2022 International Journal of Aeronautical and Space Sc Vol.23 No.2
A Markov decision process model with two stages of long-distance autonomous guidance and short-distance autonomous tracking of obstacle avoidance was developed in this study, aiming to address the performance problem of multi-rotor unmanned aerial vehicles (UAV) to ground dynamic target. On this basis, an improved proximal policy optimization (PPO) algorithm is proposed. The proposed algorithm uses long short-term memory (LSTM) network to calculate reward values, update network parameters and perform adaptive optimization iterations through status information, such as the real-time position relationship between the UAV and the target, taking into account the time-sequential data received from the UAV and the environmental context information. Finally, experiment with simulation testing was performed on platform based robot control system species. The results showed that the method proposed in this paper is able to safely and effectively realize autonomous maneuvering during the entire process of the reconnaissance mission. Compared with the traditional PPO algorithm, the introduction of LSTM neural network shortened the model training time, considerably improved the efficiency of tracking and avoiding obstacles, as well as further strengthened the robustness, accuracy, and real-time ability of the algorithm.