Person: Pérez Goya, Unai
Loading...
Email Address
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
Pérez Goya
First Name
Unai
person.page.departamento
Estadística, Informática y Matemáticas
person.page.instituteName
InaMat2. Instituto de Investigación en Materiales Avanzados y Matemáticas
ORCID
0000-0002-2796-9079
person.page.upna
811058
Name
12 results
Search Results
Now showing 1 - 10 of 12
Publication Open Access Using RGISTools to estimate water levels in reservoirs and lakes(MDPI, 2020) Militino, Ana F.; Montesino San Martín, Manuel; Pérez Goya, Unai; Ugarte Martínez, María Dolores; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2; Estadística, Informática y MatemáticasThe combination of freely accessible satellite imagery from multiple programs improves the spatio-temporal coverage of remote sensing data, but it exhibits barriers regarding the variety of web services, file formats, and data standards. Ris an open-source software environment with state-of-the-art statistical packages for the analysis of optical imagery. However, it lacks the tools for providing unified access to multi-program archives to customize and process the time series of images. This manuscript introduces RGISTools, a new software that solves these issues, and provides a working example on water mapping, which is a socially and environmentally relevant research field. The case study uses a digital elevation model and a rarely assessed combination of Landsat-8 and Sentinel-2 imagery to determine the water level of a reservoir in Northern Spain. The case study demonstrates how to acquire and process time series of surface reflectance data in an efficient manner. Our method achieves reasonably accurate results, with a root mean squared error of 0.90 m. Future improvements of the package involve the expansion of the workflow to cover the processing of radar images. This should counteract the limitation of the cloud coverage with multi-spectral images.Publication Open Access Improving the quality of satellite imagery based on ground-truth data from rain gauge stations(MDPI, 2018) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estatistika eta Ikerketa Operatiboa; Institute for Advanced Materials and Mathematics - INAMAT2; Estadística e Investigación Operativa; Gobierno de Navarra / Nafarroako GobernuaMultitemporal imagery is by and large geometrically and radiometrically accurate, but the residual noise arising from removal clouds and other atmospheric and electronic effects can produce outliers that must be mitigated to properly exploit the remote sensing information. In this study, we show how ground-truth data from rain gauge stations can improve the quality of satellite imagery. To this end, a simulation study is conducted wherein different sizes of outlier outbreaks are spread and randomly introduced in the normalized difference vegetation index (NDVI) and the day and night land surface temperature (LST) of composite images from Navarre (Spain) between 2011 and 2015. To remove outliers, a new method called thin-plate splines with covariates (TpsWc) is proposed. This method consists of smoothing the median anomalies with a thin-plate spline model, whereby transformed ground-truth data are the external covariates of the model. The performance of the proposed method is measured with the square root of the mean square error (RMSE), calculated as the root of the pixel-by-pixel mean square differences between the original data and the predicted data with the TpsWc model and with a state-space model with and without covariates. The study shows that the use of ground-truth data reduces the RMSE in both the TpsWc model and the state-space model used for comparison purposes. The new method successfully removes the abnormal data while preserving the phenology of the raw data. The RMSE reduction percentage varies according to the derived variables (NDVI or LST), but reductions of up to 20% are achieved with the new proposal.Publication Open Access Detecting change-points in the time series of surfaces occupied by pre-defined NDVI categories in continental Spain from 1981 to 2015(Springer, 2018) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estadística, Informática y Matemáticas; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2The free access to satellite images since more than 40 years ago has provoked a rapid increase of multitemporal derived information of remote sensing data that should be summarized and analyzed for future inferences. In particular, the study of trends and trend changes is of crucial interest in many studies of phenology, climatology, agriculture, hydrology, geology or many other environmental disciplines. Overall, the normalized dierence vegetation index (NDVI), as a satellite derived variable, plays a crucial role because of its usefulness for vegetation and landscape characterization, land use and land cover mapping, environmental monitoring, climate change or crop prediction models. Since the eighties, it can be retrieved all over the world from dierent satellites. In this work we propose to analyze its temporal evolution, looking for breakpoints or change-points in trends of the surfaces occupied by four NDVI classications made in Spain from 1981 to 2015. The results show a decrease of bare soils and semi-bare soils starting in the middle nineties or before, and a slight increase of middle-vegetation and high-vegetation soils starting in 1990 and 2000 respectively.Publication Open Access Stochastic spatio-temporal models for analysing NDVI distribution of GIMMS NDVI3g images(MDPI, 2017) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estatistika eta Ikerketa Operatiboa; Institute for Advanced Materials and Mathematics - INAMAT2; Estadística e Investigación Operativa; Gobierno de Navarra / Nafarroako Gobernua: Project PI015, 2016The normalized difference vegetation index (NDVI) is an important indicator for evaluating vegetation change, monitoring land surface fluxes or predicting crop models. Due to the great availability of images provided by different satellites in recent years, much attention has been devoted to testing trend changes with a time series of NDVI individual pixels. However, the spatial dependence inherent in these data is usually lost unless global scales are analyzed. In this paper, we propose incorporating both the spatial and the temporal dependence among pixels using a stochastic spatio-temporal model for estimating the NDVI distribution thoroughly. The stochastic model is a state-space model that uses meteorological data of the Climatic Research Unit (CRU TS3.10) as auxiliary information. The model will be estimated with the Expectation-Maximization (EM) algorithm. The result is a set of smoothed images providing an overall analysis of the NDVI distribution across space and time, where fluctuations generated by atmospheric disturbances, fire events, land-use/cover changes or engineering problems from image capture are treated as random fluctuations. The illustration is carried out with the third generation of NDVI images, termed NDVI3g, of the Global Inventory Modeling and Mapping Studies (GIMMS) in continental Spain. This data are taken in bymonthly periods from January 2011 to December 2013, but the model can be applied to many other variables, countries or regions with different resolutions.Publication Open Access Hierarchical spatio-temporal change-point detection(Taylor and Francis Group, 2023) Moradi, Mohammad Mehdi; Cronie, Ottmar; Pérez Goya, Unai; Mateu, Jorge; Estadística, Informática y Matemáticas; Estatistika, Informatika eta MatematikaDetecting change-points in multivariate settings is usually carried out by analyzing all marginals either independently, via univariate methods, or jointly, through multivariate approaches. The former discards any inherent dependencies between different marginals and the latter may suffer from domination/masking among different change-points of distinct marginals. As a remedy, we propose an approach which groups marginals with similar temporal behaviors, and then performs group-wise multivariate change-point detection. Our approach groups marginals based on hierarchical clustering using distances which adjust for inherent dependencies. Through a simulation study we show that our approach, by preventing domination/masking, significantly enhances the general performance of the employed multivariate change-point detection method. Finally, we apply our approach to two datasets: (i) Land Surface Temperature in Spain, during the years 2000–2021, and (ii) The WikiLeaks Afghan War Diary data.Publication Open Access Large-scale unsupervised spatio-temporal semantic analysis of vast regions from satellite images sequences(Springer, 2024) Echegoyen Arruti, Carlos; Pérez, Aritz; Santafé Rodrigo, Guzmán; Pérez Goya, Unai; Ugarte Martínez, María Dolores; Estadística, Informática y Matemáticas; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2; Universidad Pública de Navarra / Nafarroako Unibertsitate PublikoaTemporal sequences of satellite images constitute a highly valuable and abundant resource for analyzing regions of interest. However, the automatic acquisition of knowledge on a large scale is a challenging task due to different factors such as the lack of precise labeled data, the definition and variability of the terrain entities, or the inherent complexity of the images and their fusion. In this context, we present a fully unsupervised and general methodology to conduct spatio-temporal taxonomies of large regions from sequences of satellite images. Our approach relies on a combination of deep embeddings and time series clustering to capture the semantic properties of the ground and its evolution over time, providing a comprehensive understanding of the region of interest. The proposed method is enhanced by a novel procedure specifically devised to refine the embedding and exploit the underlying spatio-temporal patterns. We use this methodology to conduct an in-depth analysis of a 220 km region in northern Spain in different settings. The results provide a broad and intuitive perspective of the land where large areas are connected in a compact and well-structured manner, mainly based on climatic, phytological, and hydrological factors.Publication Open Access Logistic regression versus XGBoost for detecting burned areas using satellite images(Springer, 2024) Militino, Ana F.; Goyena Baroja, Harkaitz; Pérez Goya, Unai; Ugarte Martínez, María Dolores; Estadística, Informática y Matemáticas; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2; Universidad Pública de Navarra / Nafarroako Unibertsitate PublikoaClassical statistical methods prove advantageous for small datasets, whereas machine learning algorithms can excel with larger datasets. Our paper challenges this conventional wisdom by addressing a highly significant problem: the identification of burned areas through satellite imagery, that is a clear example of imbalanced data. The methods are illustrated in the North-Central Portugal and the North-West of Spain in October 2017 within a multi-temporal setting of satellite imagery. Daily satellite images are taken from Moderate Resolution Imaging Spectroradiometer (MODIS) products. Our analysis shows that a classical Logistic regression (LR) model competes on par, if not surpasses, a widely employed machine learning algorithm called the extreme gradient boosting algorithm (XGBoost) within this particular domain.Publication Open Access Unpaired spatio-temporal fusion of image patches (USTFIP) from cloud covered images(Elsevier, 2023) Goyena Baroja, Harkaitz; Pérez Goya, Unai; Montesino San Martín, Manuel; Militino, Ana F.; Wang, Qunming; Atkinson, Peter M.; Ugarte Martínez, María Dolores; Estadística, Informática y Matemáticas; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2Spatio-temporal image fusion aims to increase the frequency and resolution of multispectral satellite sensor images in a cost-effective manner. However, practical constraints on input data requirements and computational cost prevent a wider adoption of these methods in real case-studies. We propose an ensemble of strategies to eliminate the need for cloud-free matching pairs of satellite sensor images. The new methodology called Unpaired Spatio-Temporal Fusion of Image Patches (USTFIP) is tested in situations where classical requirements are progressively difficult to meet. Overall, the study shows that USTFIP reduces the root mean square error by 2-to-13% relative to the state-of-the-art Fit-FC fusion method, due to an efficient use of the available information. Implementation of USTFIP through parallel computing saves up to 40% of the computational time required for Fit-FC.Publication Open Access Machine learning procedures for daily interpolation of rainfall in Navarre (Spain)(Springer, 2023) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estadística, Informática y Matemáticas; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2Kriging is by far the most well known and widely used statistical method for interpolating data in spatial random fields. The main reason is that it provides the best linear unbiased predictor and it is an exact interpolator when normality is assumed. The robustness of this method allows small departures from normality, however, many meteorological, pollutant and environmental variables have extremely asymmetrical distributions and Kriging cannot be used. Machine learning techniques such as neural networks, random forest, and k-nearest neighbor can be used instead, because they do not require specific distributional assumptions. The drawback is that they do not take account of the spatial dependence, and for an optimal performance in spatial random fields more complex machine learning techniques could be considered. These techniques also require a relatively large amount of training data and they are computationally challenging to implement. For a reduced number of observations, we illustrate the performance of the aforementioned procedures using daily rainfall data of manual meteorological gauge stations in Navarre, where the only auxiliary variables available are the spatial coordinates and the altitude. The quality of the predictions is carefully checked through three versions of the relative root mean squared error (RRMSE). The conclusion is that when we cannot use Kriging, random forest and neural networks outperform k-nearest neighbor technique, and provide reliable predictions of rainfall daily data with scarce auxiliary information.Publication Open Access Interpolation of the mean anomalies for cloud filling in land surface temperature and normalized difference vegetation index(IEEE, 2019) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Genton, Marc G.; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2; Estadística, Informática y MatemáticasWhen monitoring time series of remote sensing data, it is advisable to fill gaps, i.e., missing or distorted data, caused by atmospheric effects or technical failures. In this paper, a new method for filling these gaps called interpolation of the mean anomalies (IMA) is proposed and compared with some competitors. The method consists of: 1) defining a neighborhood for the target image from previous and subsequent images across previous and subsequent years; 2) computing the mean target image of the neighborhood; 3) estimating the anomalies in the target image by subtracting the mean image from the target image; 4) filtering the anomalies; 5) averaging the anomalies over a predefined window; 6) interpolating the averaged anomalies; and 7) adding the interpolated anomalies to the mean image. To assess the performance of the IMA method, both a real example and a simulation study are conducted with a time series of Moderate Resolution Imaging Spectroradiometer (MODIS) TERRA and MODIS AQUA images captured over the region of Navarre (Spain) from 2011 to 2013. We analyze the land surface temperature (LST) day and night, and the normalized difference vegetation index (NDVI). In the simulation study, seven sizes of artificial clouds are randomly introduced to each image in the studied time series. The square root of the mean-squared prediction error (RMSE) between the original and the filled data is chosen as an indicator of the goodness of fit. The results show that the IMA method outperforms Timesat, Hants, and Gapfill (GF) in filling small, moderate, and big cloud gaps in both the day and night LST and NDVI data, reaching RMSE reductions of up to 23%.