학술논문

Prediction of Pakistani Honey Authenticity Through Machine Learning
Document Type
Periodical
Source
IEEE Access Access, IEEE. 10:87508-87521 2022
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Machine learning algorithms
Machine learning
Data analysis
Botany
Chemicals
Prediction algorithms
Nuclear magnetic resonance
botanical origin
geographical origin
physicochemical properties
Pakistani honey
Language
ISSN
2169-3536
Abstract
Honey is a high-demand product in many countries because it is high in nutritional value and rich in antioxidants. Thus, the demand for honey is increased. However, the productivity of honey is naturally lower than its demand. Therefore, honey has often become a target for adulteration. Adulteration of honey is a critical issue because the nutritional value of pure honey is reduced by adding cheap and easily available sweeteners, affecting the consumers’ health. Thus, investigating honey authenticity is popular among regulatory bodies, the food industry, retail sellers, and consumers. Several works have been done to predict the authenticity of honey using various physicochemical features. Few other works have also classified honey on the basis of geographical or botanical origin. However, previous studies have three major limitations. First, the existing studies used the imbalanced datasets, and the performance of these studies further needs attention. Second, as far as we know, no researcher has attempted to use machine learning approaches in investigating the adulteration of Pakistani honey. Finally, the dataset for predicting the authenticity of Pakistani honey is lacking. Therefore, this study proposes a novel classification model to address the aforementioned weaknesses by classifying the authenticity of Pakistani honey using machine learning algorithms and several physicochemical features. This work also presents three classification models systematically to classify the Pakistani honey into three levels. The first level classifies whether the honey is original or branded. The second level classifies the geographical origin. The botanical origin of honey is classified in the third level. Our experimental results show that the proposed features coupled with machine learning algorithms can predict the authenticity of Pakistani honey with outstanding results. We believe that our proposed work will be proved beneficial in reducing the adulteration of Pakistani honey.