학술논문

Analyzing the Performance of Feature Selection on Regression Problems: A Case Study on Older Adults’ Functional Profile
Document Type
Periodical
Source
IEEE Transactions on Emerging Topics in Computing IEEE Trans. Emerg. Topics Comput. Emerging Topics in Computing, IEEE Transactions on. 11(1):137-152 Jan, 2023
Subject
Computing and Processing
Feature extraction
Predictive models
Medical services
Computational modeling
Older adults
Input variables
Data models
Feature selection
regression
machine learning
aging informatics
healthcare data analytics
ehealth
Language
ISSN
2168-6750
2376-4562
Abstract
Healthcare systems are capable of collecting a significant number of patient health-related parameters. Analyzing them to find the reasons that cause a given disease is challenging. Feature Selection techniques have been used to address this issue—reducing these parameters to a smaller set with the most ”determinant” information. However, existing proposals usually focus on classification problems—aimed to detect whether a person is or is not suffering from an illness or from a finite set of illnesses. However, there are many situations in which health professionals need a numerical assessment to quantify the severity of an illness, thus dealing with a regression problem instead. Proposals using Feature Selection here are very limited. This paper examines several Feature Selection techniques to gauge their applicability to the regression-type problems, comparing these techniques by applying them to a real-life scenario on the functional profiles of older adults. Data from 829 functional profiles assessments in 49 residential homes were used in this study. The number of features was reduced from 31 to 25—with a correlation between inputs and outputs of 0.99 according to the $R^{2}$R2 score and a Mean Square Error (MSE) of 0.11—or to 14 features—with a correlation of 0.98 and MSE of 5.73.