학술논문

Binary classification of users of electronic cigarettes and smokeless tobacco through biomarkers to assess similarity with current and former smokers: machine learning applied to the population assessment of tobacco and health study
Document Type
Original Paper
Source
BMC Public Health. 23(1)
Subject
Population study
Smoking
E-cigarette
Oral tobacco
Supervised learning
Biomarker of exposure
Biomarker of potential harm.
Language
English
ISSN
1471-2458
Abstract
Background: Exposure to harmful and potentially harmful constituents in cigarette smoke is a risk factor for cardiovascular and respiratory diseases. Tobacco products that could reduce exposure to these constituents have been developed. However, the long-term effects of their use on health remain unclear. The Population Assessment of Tobacco and Health (PATH) study is a population-based study examining the health effects of smoking and cigarette smoking habits in the U.S. population. Participants include users of tobacco products, including electronic cigarettes and smokeless tobacco. In this study, we attempted to evaluate the population-wide effects of these products, using machine learning techniques and data from the PATH study.Methods: Biomarkers of exposure (BoE) and potential harm (BoPH) in cigarette smokers and former smokers in wave 1 of PATH were used to create binary classification machine-learning models that classified participants as either current (BoE: N = 102, BoPH: N = 428) or former smokers (BoE: N = 102, BoPH: N = 428). Data on the BoE and BoPH of users of electronic cigarettes (BoE: N = 210, BoPH: N = 258) and smokeless tobacco (BoE: N = 206, BoPH: N = 242) were input into the models to investigate whether these product users were classified as current or former smokers. The disease status of individuals classified as either current or former smokers was investigated.Results: The classification models for BoE and BoPH both had high model accuracy. More than 60% of participants who used either one of electronic cigarettes or smokeless tobacco were classified as former smokers in the classification model for BoE. Fewer than 15% of current smokers and dual users were classified as former smokers. A similar trend was found in the classification model for BoPH. Compared with those classified as former smokers, a higher percentage of those classified as current smokers had cardiovascular disease (9.9–10.9% vs. 6.3–6.4%) and respiratory diseases (19.4–22.2% vs. 14.2–16.7%).Conclusions: Users of electronic cigarettes or smokeless tobacco are likely to be similar to former smokers in their biomarkers of exposure and potential harm. This suggests that using these products helps to reduce exposure to the harmful constituents of cigarettes, and they are potentially less harmful than conventional cigarettes.