Quadcopter neural controller for take-off and landing in windy environments

Olaz Moratinos, XabierAláez Gómez, DanielPrieto Míguez, ManuelVilladangos Alonso, JesúsAstrain Escola, José Javier2023-06-162023-09-01Olaz, X., Alaez, D., Prieto, M., Villadangos, J., Astrain, J. J. (2023) Quadcopter neural controller for take-off and landing in windy environments. Expert Systems with Applications, 225, 1-15. https://doi.org/10.1016/j.eswa.2023.120146.0957-417410.1016/j.eswa.2023.120146https://academica-e.unavarra.es/handle/2454/45521This paper proposes the design of a quadcopter neural controller based on Reinforcement Learning (RL) for controlling the complete maneuvers of landing and take-off, even in variable windy conditions. To facilitate RL training, a wind model is designed, and two RL algorithms, Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization (PPO), are adapted and compared. The first phases of the learning process consider extended exploration states as a warm-up, and a novel neural network controller architecture is proposed with the addition of an adaptation layer. The neural network’s output is defined as the forces and momentum desired for the UAV, and the adaptation layer transforms forces and momentum into motor velocities. By decoupling attitude from motor velocities, the adaptation layer enhances a more straightforward interpretation of the neural network output and helps refine the rewards. The successful neural controller training has been tested up to 36 km/h wind speed.application/pdfeng© 2023 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license.QuadcopterTake-offLandingDeep reinforcement learningWindPPODDPGQuadcopter neural controller for take-off and landing in windy environmentsinfo:eu-repo/semantics/article2023-06-16info:eu-repo/semantics/openAccess