AI approaches for phenotyping Alzheimer's disease and related dementias using electronic health records

Abstract INTRODUCTION The current standard electronic (e‐)phenotype for identifying patients with Alzheimer's disease and related dementias (ADRD) from medical claims data yields suboptimal diagnostic accuracy. This study leveraged artificial intelligence (AI)–based text‐classification methods...

Full description

Saved in:
Bibliographic Details
Main Authors: Sara Knox, Stephanie Aghamoosa, Paul M. Heider, Maxwell Cutty, Andrew Wright, Dmitry Scherbakov, Gabriel Hood, Sara A. Nolin, Jihad S. Obeid
Format: Article
Language:English
Published: Wiley 2025-04-01
Series:Alzheimer’s & Dementia: Translational Research & Clinical Interventions
Subjects:
Online Access:https://doi.org/10.1002/trc2.70089
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract INTRODUCTION The current standard electronic (e‐)phenotype for identifying patients with Alzheimer's disease and related dementias (ADRD) from medical claims data yields suboptimal diagnostic accuracy. This study leveraged artificial intelligence (AI)–based text‐classification methods to improve the identification of patients with dementia due to ADRD using clinical notes from electronic health records (EHRs). METHODS EHR data for patients aged ≥ 64 (N = 4000) from an academic medical center were used. The cohort included 1000 patients with ADRD per the Chronic Conditions Warehouse (CCW) algorithm for ADRD (i.e., at least one ADRD International Classification of Diseases, Tenth Revision codes [ICD‐10 code]) and 3000 matched controls without ADRD (i.e., no CCW codes). We trained several AI‐based text‐classification models, including bag‐of‐words models, deep learning, and large language models (LLMs), to make ADRD determinations from clinical notes. The performance of each model was evaluated against “gold standard” manual chart review. RESULTS A foundational LLM derived from Llama 2 demonstrated superior performance in identifying patients with ADRD (area under the curve [AUC] = 0.9534, F1 score 0.8571) compared to both the current standard CCW algorithm (AUC = 0.8482, F1 score 0.8323, although only the AUC was statistically significantly different) and other AI‐based models. Several of the AI‐based models, including convolutional neural networks, also outperformed the CCW algorithm. DISCUSSION These findings highlight the potential of AI‐based text‐classification methods to optimize the automated identification of patients with ADRD using rich EHR data. However, the success of this approach depends on the quality of clinical notes, and more work is needed to refine and validate these methods across more diverse data sets. Highlights The current e‐phenotype for patients with Alzheimer's disease and related dementias (ADRD) in electronic health records has suboptimal diagnostic accuracy. The study used artificial intelligence (AI)–based text classification methods to improve the detection of patients with ADRD. AI‐based models, including convolutional neural networks, outperformed the Chronic Conditions Warehouse algorithm.
ISSN:2352-8737