Real-time object geopositioning from monocular target detection/tracking for aerial cinematography

Date

2023-12-08

Authors

Mygdalis, Vasileios
Pitas, Ioannis

Director

Publisher

IEEE
Acceso abierto / Sarbide irekia
Contribución a congreso / Biltzarrerako ekarpena
Versión aceptada / Onetsi den bertsioa

Project identifier

  • European Commission/Horizon 2020 Framework Programme/951911/ openaire
  • European Commission/Horizon 2020 Framework Programme/871479/ openaire
Impacto
Plum Print visual indicator of research metrics
  • Citations
    • Policy Citations: 1
    • Citation Indexes: 1
see details
OpenAlexGoogle Scholar
cited by count

Abstract

In recent years, the field of automated aerial cinematography has seen a significant increase in demand for real-time 3D target geopositioning for motion and shot planning. To this end, many of the existing cinematography plans require the use of complex sensors that need to be equipped on the subject or rely on external motion systems. This work addresses this problem by combining monocular visual target detection and tracking with a simple ground intersection model. Under the assumption that the targets to be filmed typically stand on the ground, 3D target localization is achieved by estimating the direction and the norm of the look-at vector. The proposed algorithm employs an error estimation model that accounts for the error in detecting the bounding box, the height estimation errors, and the uncertainties of the pitch and yaw angles. This algorithm has been fully implemented in a heavy-lifting aerial cinematography hexacopter, and its performance has been evaluated through experimental flights. Results show that typical errors are within 5 meters of absolute distance and 3 degrees of angular error for distances to the target of around 100 meters.

Description

Keywords

3D geopositioning, Aerial cinematography, Monocular, Target detection

Department

Estadística, Informática y Matemáticas / Estatistika, Informatika eta Matematika

Faculty/School

Degree

Doctorate program

item.page.cita

Alaez, D., Mygdalis, V., Villadangos, J., Pitas, I. (2023) Real-time object geopositioning from monocular target detection/tracking for aerial cinematography. In [IEEE], 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP) (pp. 1-6). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MMSP59012.2023.10337638.

item.page.rights

© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other work.

Los documentos de Academica-e están protegidos por derechos de autor con todos los derechos reservados, a no ser que se indique lo contrario.