학술논문

A Prototype Application to Identify LGBT Patients in Clinical Notes
Document Type
Conference
Source
2020 IEEE International Conference on Big Data (Big Data) Big Data (Big Data), 2020 IEEE International Conference on. :4270-4275 Dec, 2020
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Engineering Profession
Geoscience
Signal Processing and Analysis
Sensitivity
Prototypes
Medical services
Machine learning
Big Data
Electronic medical records
Task analysis
LGBT
sexual orientation
gender identity
natural language processing
machine learning
Language
Abstract
LGBT Patients bear a disproportional burden of health disparities. Sexual orientation and gender identity is of clinical relevance to healthcare providers and data scientists. However, little work has been done to identify LGBT patients, especially in data derived from electronic health record notes. We developed a prototype application that leverages machine learning and rule-based pattern matching methods to identify LGBT patients in a large data source, Veterans Health Administration electronic health record notes. This application achieved 88.2% sensitivity, 91.5% specificity, and 85.9% positive predictive value in a binary classification task for three random document test sets. This work has implications in both improved healthcare and data research.