학술논문

Evaluating and Improving the Performance and Racial Fairness of Algorithms for GFR Estimation
Document Type
Conference
Source
2024 IEEE First International Conference on Artificial Intelligence for Medicine, Health and Care (AIMHC) AIMHC Artificial Intelligence for Medicine, Health and Care (AIMHC), 2024 IEEE First International Conference on. :251-257 Feb, 2024
Subject
Computing and Processing
Support vector machines
Estimation error
Machine learning algorithms
Filtration
Computational modeling
Neural networks
Predictive models
algorithmic fairness
glomerular filtration rate
predictive modeling
machine learning
electronic health record
Language
Abstract
Data-driven clinical prediction algorithms are used widely by clinicians. Understanding what factors can impact the performance and fairness of data-driven algorithms is an important step towards achieving equitable healthcare. To investigate the impact of modeling choices on the algorithmic performance and fairness, we make use of a case study to build a prediction algorithm for estimating glomerular filtration rate (GFR) based on the patient's electronic health record (EHR). We compare three distinct approaches for estimating GFR: CKD-EPI equations, epidemiological models, and EHR-based models. For epidemiological models and EHR-based models, four machine learning models of varying computational complexity (i.e., linear regression, support vector machine, random forest regression, and neural network) were compared. Performance metrics included root mean squared error (RMSE), median difference, and the proportion of GFR estimates within 30% of the measured GFR value (P30). Differential performance between non-African American and African American group was used to assess algorithmic fairness with respect to race. Our study showed that the variable race had a negligible effect on error, accuracy, and differential performance. Furthermore, including more relevant clinical features (e.g., common comorbidities of chronic kidney disease) and using more complex machine learning models, namely random forest regression, significantly lowered the estimation error of GFR. However, the difference in performance between African American and non-African American patients did not decrease, where the estimation error for African American patients remained consistently higher than non-African American patients, indicating that more objective patient characteristics should be discovered and included to improve algorithm performance.