Development and Evaluation of a Computable Phenotype for Normal Tension Glaucoma
Purpose: To develop a computable phenotype for normal tension glaucoma (NTG) to enhance disease identification from electronic health records (EHRs). Design: Retrospective cohort study. Subjects: Deidentified EHR data from an academic medical center identified 1851 patients aged ≥40 years, with glau...
Saved in:
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-11-01
|
Series: | Ophthalmology Science |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666914525001563 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Purpose: To develop a computable phenotype for normal tension glaucoma (NTG) to enhance disease identification from electronic health records (EHRs). Design: Retrospective cohort study. Subjects: Deidentified EHR data from an academic medical center identified 1851 patients aged ≥40 years, with glaucoma and available clinical notes. Methods: Of these 1851 patients, 200 were randomly selected for a chart review to receive gold standard diagnoses. Four rule-based NTG computable phenotypes were developed and tested. Phenotype 1 relied on NTG International Classification of Diseases (ICD)-9 and ICD-10 codes. Phenotype 2 incorporated structured intraocular pressure (IOP) data and medication lists. Phenotype 3 used only structured IOP data. Phenotype 4 combined structured IOP and medication data natural language processing (NLP) to extract IOP values and NTG mentions from chart notes. Internal and external validation were performed. Main Outcome Measures: F1 score, sensitivities, specificities, positive predictive value (PPV), negative predictive value (NPV), and accuracy. Results: Chart review identified NTG in 30% of patients, and only 7% had NTG ICD codes. Phenotype 1 had an F1 of 36.8%, sensitivity 24.1%, specificity 97%, PPV 77.8%, NPV 74.9%, and accuracy 75.1%. Compared with ICD codes, phenotypes 2 and 3 had F1 of 66.7% and 69.8%, sensitivity 77.6% and 89.7%, specificity 76.3% and 71.1%, PPV 58.4% and 57.1%, NPV 88.8% and 94.1%, and accuracy of 76.7% and 76.7%, respectively. Incorporating NLP, phenotype 4 had the best performance with an F1 of 77.4%, sensitivity 82.8%, specificity 86.7%, PPV 72.7%, NPV 92.1%, and accuracy 85.5%. Phenotypes 2 to 4 increase NTG case detection fourfold compared with phenotype 1. Conclusions: Normal tension glaucoma phenotypes using NLP achieved the best overall performance, and those incorporating structured data perform better than ICD codes alone. The NTG ICD code-based phenotype is highly specific but lacks sensitivity. Insights from this study may inform the development of computable phenotypes for other disease subtypes within broader disease categories. Financial Disclosure(s): Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article. |
---|---|
ISSN: | 2666-9145 |