TY - JOUR
T1 - Adulterant estimation in paprika powder using deep learning and chemometrics through near-infrared spectroscopy
AU - Castro, Wilson
AU - Oblitas, Jimy
AU - Nuñez, Luis
AU - Yoplac, Ives
AU - Avila-George, Himer
AU - De-la-Torre, Miguel
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
PY - 2024
Y1 - 2024
N2 - Spices and other food products have been permanently susceptible to adulteration, affecting safety and acceptability when commercialized. A relevant alternative to detect contaminants in food products is to couple near-infrared spectroscopy (NIR) with chemometrics. Among the most accurate chemometric techniques employed to analyze food products, partial least squares regression (PLSR) combines features from and generalizes principal component analysis (PCA) to create compact and accurate models. Other techniques inspired in the human brain, such as multilayer perceptron, the long short-term memory (LSTM) models, and other approaches based on deep learning, take advantage of the high complexity of weights and neurons to train models based on large amounts of data. In this paper, a methodology is proposed to evaluate chemometric tools to estimate the percentage of adulterants in paprika powder using NIR spectroscopy, and three approaches are proposed and compared showing different performances. According to the methodology, the paprika samples were dried and separated into pericarp, peduncle, and seed cake. The resulting elements were finely milled, sieved, and mixed into 21 different combinations with a different percentage of each. Spectral profiles were used to train PLSR, multilayer perceptron, and regression models based on LSTM networks. The models were compared following a k-fold cross-validation strategy. Results showed that PLSR presented the highest R2=0.978 for peduncle adulterant estimation, and the lowest RMSE=6.24. In particular, when seed cake powder was used as an adulterant, the PLSR approach showed the highest R2=0.981, and the lowest RMSE=5.806. The RPD values were higher than 2.000 for all models that use the peduncle as an adulterant and only for models bound to the PLSR in the adulterated samples with pressed seed cake. In summary, the best predictions were obtained using PLSR models, providing evidence of the feasibility of using NIR spectra to estimate the percentage of adulterants in paprika powder.
AB - Spices and other food products have been permanently susceptible to adulteration, affecting safety and acceptability when commercialized. A relevant alternative to detect contaminants in food products is to couple near-infrared spectroscopy (NIR) with chemometrics. Among the most accurate chemometric techniques employed to analyze food products, partial least squares regression (PLSR) combines features from and generalizes principal component analysis (PCA) to create compact and accurate models. Other techniques inspired in the human brain, such as multilayer perceptron, the long short-term memory (LSTM) models, and other approaches based on deep learning, take advantage of the high complexity of weights and neurons to train models based on large amounts of data. In this paper, a methodology is proposed to evaluate chemometric tools to estimate the percentage of adulterants in paprika powder using NIR spectroscopy, and three approaches are proposed and compared showing different performances. According to the methodology, the paprika samples were dried and separated into pericarp, peduncle, and seed cake. The resulting elements were finely milled, sieved, and mixed into 21 different combinations with a different percentage of each. Spectral profiles were used to train PLSR, multilayer perceptron, and regression models based on LSTM networks. The models were compared following a k-fold cross-validation strategy. Results showed that PLSR presented the highest R2=0.978 for peduncle adulterant estimation, and the lowest RMSE=6.24. In particular, when seed cake powder was used as an adulterant, the PLSR approach showed the highest R2=0.981, and the lowest RMSE=5.806. The RPD values were higher than 2.000 for all models that use the peduncle as an adulterant and only for models bound to the PLSR in the adulterated samples with pressed seed cake. In summary, the best predictions were obtained using PLSR models, providing evidence of the feasibility of using NIR spectra to estimate the percentage of adulterants in paprika powder.
KW - LSTM
KW - NIR
KW - Paprika pepper
KW - Perceptron
KW - PLSR
UR - http://www.scopus.com/inward/record.url?scp=85192216684&partnerID=8YFLogxK
U2 - 10.1007/s00521-024-09830-8
DO - 10.1007/s00521-024-09830-8
M3 - Article
AN - SCOPUS:85192216684
SN - 0941-0643
JO - Neural Computing and Applications
JF - Neural Computing and Applications
ER -