Gracia Moisés, AnderVitoria Pascual, IgnacioAvedillo de la Casa, AmaiaMoreno Pérez, MaríaImas González, José JavierRuiz Zamarreño, Carlos2024-11-202025-01-01Gracia Moisés, A., Vitoria-Pascual, I., Avedillo de la Casa, A., Moreno-Pérez, M., Imas González, J. J., Ruiz-Zamarreño, C. (2025). Optimization of optical spectroscopy classification algorithms for limited data scenarios in the food industry: Tomato sauce samples case. Food Control, 167, 1-9. https://doi.org/10.1016/j.foodcont.2024.110819.0956-713510.1016/j.foodcont.2024.110819https://academica-e.unavarra.es/handle/2454/52546This study addresses the problem of training deep learning models with limited datasets, a significant challenge in sectors like medical imaging and food quality analysis. To tackle this issue, generative adversarial networks (GANs) will be employed to augment the available data and improve model performance. An innovative approach is introduced here, integrating semi-supervised learning and generative modeling to maximize the use of small datasets in developing robust models. The method involves reversing the conventional distribution of training and testing data to focus on model evaluation and generalization from limited samples. Wasserstein GANs (WGANs) and Semi-Supervised GANs (SGANs), are utilized to supplement datasets with synthetic but realistic examples, enhancing the training process in scenarios of data scarcity. These techniques are applied in the context of visible reflectance spectroscopy to analyze tomato sauces, demonstrating the method's effectiveness in non-invasively assessing key quality parameters such as oil content, °Brix, and pH. The results show significant improvements in model performance metrics: for %Oil content, overall accuracy increased from 0.47 to 0.66; for °Bx, it rose from 0.65 to 0.71; and for pH measurement, accuracy improved from 0.43 to 0.62. These outcomes highlight the model's improved capability to generalize and maintain accuracy with limited data.application/pdfeng© 2024 Elsevier Ltd. This manuscript version is made available under the CC-BY-NC-ND 4.0.Food quality controlGenerative adversarial networksOil contentpHSemi-supervised learningVisible reflectance spectroscopy°BrixOptimization of optical spectroscopy classification algorithms for limited data scenarios in the food industry: tomato sauce samples caseinfo:eu-repo/semantics/article2024-11-20info:eu-repo/semantics/embargoedAccess