Person:
Pérez Goya, Unai

Loading...
Profile Picture

Email Address

Birth Date

Research Projects

Organizational Units

Job Title

Last Name

Pérez Goya

First Name

Unai

person.page.departamento

Estadística, Informática y Matemáticas

person.page.instituteName

InaMat2. Instituto de Investigación en Materiales Avanzados y Matemáticas

ORCID

0000-0002-2796-9079

person.page.upna

811058

Name

Search Results

Now showing 1 - 3 of 3
  • PublicationOpen Access
    Machine learning procedures for daily interpolation of rainfall in Navarre (Spain)
    (Springer, 2023) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estadística, Informática y Matemáticas; Estatistika, Informatika eta Matematika; Institute for Advanced Materials and Mathematics - INAMAT2
    Kriging is by far the most well known and widely used statistical method for interpolating data in spatial random fields. The main reason is that it provides the best linear unbiased predictor and it is an exact interpolator when normality is assumed. The robustness of this method allows small departures from normality, however, many meteorological, pollutant and environmental variables have extremely asymmetrical distributions and Kriging cannot be used. Machine learning techniques such as neural networks, random forest, and k-nearest neighbor can be used instead, because they do not require specific distributional assumptions. The drawback is that they do not take account of the spatial dependence, and for an optimal performance in spatial random fields more complex machine learning techniques could be considered. These techniques also require a relatively large amount of training data and they are computationally challenging to implement. For a reduced number of observations, we illustrate the performance of the aforementioned procedures using daily rainfall data of manual meteorological gauge stations in Navarre, where the only auxiliary variables available are the spatial coordinates and the altitude. The quality of the predictions is carefully checked through three versions of the relative root mean squared error (RRMSE). The conclusion is that when we cannot use Kriging, random forest and neural networks outperform k-nearest neighbor technique, and provide reliable predictions of rainfall daily data with scarce auxiliary information.
  • PublicationOpen Access
    Stochastic spatio-temporal models for analysing NDVI distribution of GIMMS NDVI3g images
    (MDPI, 2017) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estatistika eta Ikerketa Operatiboa; Institute for Advanced Materials and Mathematics - INAMAT2; Estadística e Investigación Operativa; Gobierno de Navarra / Nafarroako Gobernua: Project PI015, 2016
    The normalized difference vegetation index (NDVI) is an important indicator for evaluating vegetation change, monitoring land surface fluxes or predicting crop models. Due to the great availability of images provided by different satellites in recent years, much attention has been devoted to testing trend changes with a time series of NDVI individual pixels. However, the spatial dependence inherent in these data is usually lost unless global scales are analyzed. In this paper, we propose incorporating both the spatial and the temporal dependence among pixels using a stochastic spatio-temporal model for estimating the NDVI distribution thoroughly. The stochastic model is a state-space model that uses meteorological data of the Climatic Research Unit (CRU TS3.10) as auxiliary information. The model will be estimated with the Expectation-Maximization (EM) algorithm. The result is a set of smoothed images providing an overall analysis of the NDVI distribution across space and time, where fluctuations generated by atmospheric disturbances, fire events, land-use/cover changes or engineering problems from image capture are treated as random fluctuations. The illustration is carried out with the third generation of NDVI images, termed NDVI3g, of the Global Inventory Modeling and Mapping Studies (GIMMS) in continental Spain. This data are taken in bymonthly periods from January 2011 to December 2013, but the model can be applied to many other variables, countries or regions with different resolutions.
  • PublicationOpen Access
    Improving the quality of satellite imagery based on ground-truth data from rain gauge stations
    (MDPI, 2018) Militino, Ana F.; Ugarte Martínez, María Dolores; Pérez Goya, Unai; Estatistika eta Ikerketa Operatiboa; Institute for Advanced Materials and Mathematics - INAMAT2; Estadística e Investigación Operativa; Gobierno de Navarra / Nafarroako Gobernua
    Multitemporal imagery is by and large geometrically and radiometrically accurate, but the residual noise arising from removal clouds and other atmospheric and electronic effects can produce outliers that must be mitigated to properly exploit the remote sensing information. In this study, we show how ground-truth data from rain gauge stations can improve the quality of satellite imagery. To this end, a simulation study is conducted wherein different sizes of outlier outbreaks are spread and randomly introduced in the normalized difference vegetation index (NDVI) and the day and night land surface temperature (LST) of composite images from Navarre (Spain) between 2011 and 2015. To remove outliers, a new method called thin-plate splines with covariates (TpsWc) is proposed. This method consists of smoothing the median anomalies with a thin-plate spline model, whereby transformed ground-truth data are the external covariates of the model. The performance of the proposed method is measured with the square root of the mean square error (RMSE), calculated as the root of the pixel-by-pixel mean square differences between the original data and the predicted data with the TpsWc model and with a state-space model with and without covariates. The study shows that the use of ground-truth data reduces the RMSE in both the TpsWc model and the state-space model used for comparison purposes. The new method successfully removes the abnormal data while preserving the phenology of the raw data. The RMSE reduction percentage varies according to the derived variables (NDVI or LST), but reductions of up to 20% are achieved with the new proposal.