Data augmentation techniques for machine learning applied to optical spectroscopy datasets in agrifood applications: a comprehensive review

dc.contributor.authorGracia Moisés, Ander
dc.contributor.authorVitoria Pascual, Ignacio
dc.contributor.authorImas González, José Javier
dc.contributor.authorRuiz Zamarreño, Carlos
dc.contributor.departmentIngeniería Eléctrica, Electrónica y de Comunicaciónes_ES
dc.contributor.departmentIngeniaritza Elektrikoa, Elektronikoa eta Telekomunikazio Ingeniaritzaeu
dc.contributor.departmentInstitute of Smart Cities - ISCen
dc.date.accessioned2024-05-21T17:02:55Z
dc.date.available2024-05-21T17:02:55Z
dc.date.issued2023
dc.date.updated2024-05-21T16:28:31Z
dc.description.abstractMachine learning (ML) and deep learning (DL) have achieved great success in different tasks. These include computer vision, image segmentation, natural language processing, predicting classification, evaluating time series, and predicting values based on a series of variables. As artificial intelligence progresses, new techniques are being applied to areas like optical spectroscopy and its uses in specific fields, such as the agrifood industry. The performance of ML and DL techniques generally improves with the amount of data available. However, it is not always possible to obtain all the necessary data for creating a robust dataset. In the particular case of agrifood applications, dataset collection is generally constrained to specific periods. Weather conditions can also reduce the possibility to cover the entire range of classifications with the consequent generation of imbalanced datasets. To address this issue, data augmentation (DA) techniques are employed to expand the dataset by adding slightly modified copies of existing data. This leads to a dataset that includes values from laboratory tests, as well as a collection of synthetic data based on the real data. This review work will present the application of DA techniques to optical spectroscopy datasets obtained from real agrifood industry applications. The reviewed methods will describe the use of simple DA techniques, such as duplicating samples with slight changes, as well as the utilization of more complex algorithms based on deep learning generative adversarial networks (GANs), and semi-supervised generative adversarial networks (SGANs).en
dc.description.sponsorshipThis work was supported in part by the Industrial Doctorate grants 2021 from Government and by the PID2022-1374370B-100 research grant from the Ministry of Science and Innovationen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationGracia Moisés, A., Vitoria Pascual, I., Imas González, J. J., Zamarreño, C. R. (2023) Data augmentation techniques for machine learning applied to optical spectroscopy datasets in agrifood applications: A comprehensive review. Sensors, 23(20), 1-29. https://doi.org/10.3390/s23208562.en
dc.identifier.doi10.3390/s23208562
dc.identifier.issn1424-8220
dc.identifier.urihttps://academica-e.unavarra.es/handle/2454/48147
dc.language.isoengen
dc.publisherMDPIen
dc.relation.ispartofSensors 2023, 23(20), 8562en
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI//PID2022-1374370B-100/
dc.relation.publisherversionhttps://doi.org/10.3390/s23208562
dc.rights© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.en
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectOptical spectroscopyen
dc.subjectAgrifood industryen
dc.subjectArtificial intelligenceen
dc.subjectData augmentation (DA)en
dc.subjectGenerative adversarial networks (GANs)en
dc.titleData augmentation techniques for machine learning applied to optical spectroscopy datasets in agrifood applications: a comprehensive reviewen
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion
dspace.entity.typePublication
relation.isAuthorOfPublication5131262a-e53d-4f80-83e3-f4f58d720031
relation.isAuthorOfPublication1d3ab98f-d163-4f9d-8baf-a6f5130c957c
relation.isAuthorOfPublication77495fb3-fd9a-4636-be2d-37165dd2e90d
relation.isAuthorOfPublicationf85c4fed-8804-4e02-b746-0855066291e3
relation.isAuthorOfPublication.latestForDiscovery1d3ab98f-d163-4f9d-8baf-a6f5130c957c

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Gracia_DataAugmentation.pdf
Size:
5.21 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.78 KB
Format:
Item-specific license agreed to upon submission
Description: