Olaz Moratinos, XabierAláez Gómez, DanielPrieto Míguez, ManuelVilladangos Alonso, JesúsAstrain Escola, José Javier2023-06-162023Olaz, X., Alaez, D., Prieto, M., Villadangos, J., Astrain, J. J. (2023) Quadcopter neural controller for take-off and landing in windy environments. Expert Systems with Applications, 225, 1-15. https://doi.org/10.1016/j.eswa.2023.120146.0957-417410.1016/j.eswa.2023.120146https://academica-e.unavarra.es/handle/2454/45521This paper proposes the design of a quadcopter neural controller based on Reinforcement Learning (RL) for controlling the complete maneuvers of landing and take-off, even in variable windy conditions. To facilitate RL training, a wind model is designed, and two RL algorithms, Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization (PPO), are adapted and compared. The first phases of the learning process consider extended exploration states as a warm-up, and a novel neural network controller architecture is proposed with the addition of an adaptation layer. The neural network’s output is defined as the forces and momentum desired for the UAV, and the adaptation layer transforms forces and momentum into motor velocities. By decoupling attitude from motor velocities, the adaptation layer enhances a more straightforward interpretation of the neural network output and helps refine the rewards. The successful neural controller training has been tested up to 36 km/h wind speed.application/pdfeng© 2023 Elsevier Ltd. This manuscript version is made available under the CC-BY-NC-ND 4.0QuadcopterTake-offLandingDeep reinforcement learningWindPPODDPGQuadcopter neural controller for take-off and landing in windy environmentsArtículo / Artikulua2023-06-16Acceso embargado / Sarbidea bahitua dagoinfo:eu-repo/semantics/embargoedAccess