Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
Aim. To develop models for predicting hospitalizations of hypertensive (HTN) over 12 months using machine learning algorithms and to validate them using real-world practice data.Material and methods. Based on the data from depersonalized electronic health records obtained from the Webiomed platform,...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | Russian |
Published: |
«SILICEA-POLIGRAF» LLC
2025-03-01
|
Series: | Кардиоваскулярная терапия и профилактика |
Subjects: | |
Online Access: | https://cardiovascular.elpub.ru/jour/article/view/4130 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1839577694776328192 |
---|---|
author | A. E. Andreychenko A. D. Ermak D. V. Gavrilov R. E. Novitsky O. M. Drapkina A. V. Gusev |
author_facet | A. E. Andreychenko A. D. Ermak D. V. Gavrilov R. E. Novitsky O. M. Drapkina A. V. Gusev |
author_sort | A. E. Andreychenko |
collection | DOAJ |
description | Aim. To develop models for predicting hospitalizations of hypertensive (HTN) over 12 months using machine learning algorithms and to validate them using real-world practice data.Material and methods. Based on the data from depersonalized electronic health records obtained from the Webiomed platform, 1165770 records of 151492 patients with HTN were selected. After the initial selection, a total of 43 anamnestic, constitutional, clinical, and paraclinical features were used as predictors. Automatic machine learning tools were used to create the models. A wide range of algorithms was considered, including logistic regression, decision tree-based methods using gradient boosting and bagging, discriminant analysis, a neural network algorithm and a naive Bayes classifier. Data from a single region were used for external validation.Results. The XGBoost model showed the best results, achieving an area under the ROC curve (AUC) of 0,849 (95% confidence interval: 0,825-0,873) during internal testing and 0,815 (95% confidence interval: 0,797-0,835) during external validation.Conclusion. A new highly accurate model for predicting hospitalization of HTN patients based on real-world data was developed. The results of external validation of the final model showed relative resistance to new data from another region that in combination with quality metrics presents the possibility of its approval for application in clinical practice. |
format | Article |
id | doaj-art-152c4d5c4fbe45d69a52fef4edfb9c4c |
institution | Matheson Library |
issn | 1728-8800 2619-0125 |
language | Russian |
publishDate | 2025-03-01 |
publisher | «SILICEA-POLIGRAF» LLC |
record_format | Article |
series | Кардиоваскулярная терапия и профилактика |
spelling | doaj-art-152c4d5c4fbe45d69a52fef4edfb9c4c2025-08-04T12:50:32Zrus«SILICEA-POLIGRAF» LLCКардиоваскулярная терапия и профилактика1728-88002619-01252025-03-0124110.15829/1728-8800-2025-41303099Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 monthsA. E. Andreychenko0A. D. Ermak1D. V. Gavrilov2R. E. Novitsky3O. M. Drapkina4A. V. Gusev5OOO K-SkyOOO K-SkyOOO K-SkyOOO K-SkyNational Medical Research Center for Therapy and Preventive MedicineCentral Research Institute for Health Organization and Informatics; Research and Practical Clinical Center for Diagnostics and Telemedicine TechnologiesAim. To develop models for predicting hospitalizations of hypertensive (HTN) over 12 months using machine learning algorithms and to validate them using real-world practice data.Material and methods. Based on the data from depersonalized electronic health records obtained from the Webiomed platform, 1165770 records of 151492 patients with HTN were selected. After the initial selection, a total of 43 anamnestic, constitutional, clinical, and paraclinical features were used as predictors. Automatic machine learning tools were used to create the models. A wide range of algorithms was considered, including logistic regression, decision tree-based methods using gradient boosting and bagging, discriminant analysis, a neural network algorithm and a naive Bayes classifier. Data from a single region were used for external validation.Results. The XGBoost model showed the best results, achieving an area under the ROC curve (AUC) of 0,849 (95% confidence interval: 0,825-0,873) during internal testing and 0,815 (95% confidence interval: 0,797-0,835) during external validation.Conclusion. A new highly accurate model for predicting hospitalization of HTN patients based on real-world data was developed. The results of external validation of the final model showed relative resistance to new data from another region that in combination with quality metrics presents the possibility of its approval for application in clinical practice.https://cardiovascular.elpub.ru/jour/article/view/4130hypertensionhospitalizationpredictive modelsmachine learning |
spellingShingle | A. E. Andreychenko A. D. Ermak D. V. Gavrilov R. E. Novitsky O. M. Drapkina A. V. Gusev Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months Кардиоваскулярная терапия и профилактика hypertension hospitalization predictive models machine learning |
title | Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months |
title_full | Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months |
title_fullStr | Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months |
title_full_unstemmed | Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months |
title_short | Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months |
title_sort | development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months |
topic | hypertension hospitalization predictive models machine learning |
url | https://cardiovascular.elpub.ru/jour/article/view/4130 |
work_keys_str_mv | AT aeandreychenko developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months AT adermak developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months AT dvgavrilov developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months AT renovitsky developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months AT omdrapkina developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months AT avgusev developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months |