Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months

Aim. To develop models for predicting hospitalizations of hypertensive (HTN) over 12 months using machine learning algorithms and to validate them using real-world practice data.Material and methods. Based on the data from depersonalized electronic health records obtained from the Webiomed platform,...

Full description

Saved in:
Bibliographic Details
Main Authors: A. E. Andreychenko, A. D. Ermak, D. V. Gavrilov, R. E. Novitsky, O. M. Drapkina, A. V. Gusev
Format: Article
Language:Russian
Published: «SILICEA-POLIGRAF» LLC 2025-03-01
Series:Кардиоваскулярная терапия и профилактика
Subjects:
Online Access:https://cardiovascular.elpub.ru/jour/article/view/4130
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839577694776328192
author A. E. Andreychenko
A. D. Ermak
D. V. Gavrilov
R. E. Novitsky
O. M. Drapkina
A. V. Gusev
author_facet A. E. Andreychenko
A. D. Ermak
D. V. Gavrilov
R. E. Novitsky
O. M. Drapkina
A. V. Gusev
author_sort A. E. Andreychenko
collection DOAJ
description Aim. To develop models for predicting hospitalizations of hypertensive (HTN) over 12 months using machine learning algorithms and to validate them using real-world practice data.Material and methods. Based on the data from depersonalized electronic health records obtained from the Webiomed platform, 1165770 records of 151492 patients with HTN were selected. After the initial selection, a total of 43 anamnestic, constitutional, clinical, and paraclinical features were used as predictors. Automatic machine learning tools were used to create the models. A wide range of algorithms was considered, including logistic regression, decision tree-based methods using gradient boosting and bagging, discriminant analysis, a neural network algorithm and a naive Bayes classifier. Data from a single region were used for external validation.Results. The XGBoost model showed the best results, achieving an area under the ROC curve (AUC) of 0,849 (95% confidence interval: 0,825-0,873) during internal testing and 0,815 (95% confidence interval: 0,797-0,835) during external validation.Conclusion. A new highly accurate model for predicting hospitaliza­tion of HTN patients based on real-world data was developed. The results of external validation of the final model showed relative re­sistance to new data from another region that in combination with quality metrics presents the possibility of its approval for application in clinical practice.
format Article
id doaj-art-152c4d5c4fbe45d69a52fef4edfb9c4c
institution Matheson Library
issn 1728-8800
2619-0125
language Russian
publishDate 2025-03-01
publisher «SILICEA-POLIGRAF» LLC
record_format Article
series Кардиоваскулярная терапия и профилактика
spelling doaj-art-152c4d5c4fbe45d69a52fef4edfb9c4c2025-08-04T12:50:32Zrus«SILICEA-POLIGRAF» LLCКардиоваскулярная терапия и профилактика1728-88002619-01252025-03-0124110.15829/1728-8800-2025-41303099Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 monthsA. E. Andreychenko0A. D. Ermak1D. V. Gavrilov2R. E. Novitsky3O. M. Drapkina4A. V. Gusev5OOO K-SkyOOO K-SkyOOO K-SkyOOO K-SkyNational Medical Research Center for Therapy and Preventive MedicineCentral Research Institute for Health Organization and Informatics; Research and Practical Clinical Center for Diagnostics and Telemedicine TechnologiesAim. To develop models for predicting hospitalizations of hypertensive (HTN) over 12 months using machine learning algorithms and to validate them using real-world practice data.Material and methods. Based on the data from depersonalized electronic health records obtained from the Webiomed platform, 1165770 records of 151492 patients with HTN were selected. After the initial selection, a total of 43 anamnestic, constitutional, clinical, and paraclinical features were used as predictors. Automatic machine learning tools were used to create the models. A wide range of algorithms was considered, including logistic regression, decision tree-based methods using gradient boosting and bagging, discriminant analysis, a neural network algorithm and a naive Bayes classifier. Data from a single region were used for external validation.Results. The XGBoost model showed the best results, achieving an area under the ROC curve (AUC) of 0,849 (95% confidence interval: 0,825-0,873) during internal testing and 0,815 (95% confidence interval: 0,797-0,835) during external validation.Conclusion. A new highly accurate model for predicting hospitaliza­tion of HTN patients based on real-world data was developed. The results of external validation of the final model showed relative re­sistance to new data from another region that in combination with quality metrics presents the possibility of its approval for application in clinical practice.https://cardiovascular.elpub.ru/jour/article/view/4130hypertensionhospitalizationpredictive modelsmachine learning
spellingShingle A. E. Andreychenko
A. D. Ermak
D. V. Gavrilov
R. E. Novitsky
O. M. Drapkina
A. V. Gusev
Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
Кардиоваскулярная терапия и профилактика
hypertension
hospitalization
predictive models
machine learning
title Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
title_full Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
title_fullStr Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
title_full_unstemmed Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
title_short Development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
title_sort development and validation of machine learning models predicting hospitalizations of hypertensive patients over 12 months
topic hypertension
hospitalization
predictive models
machine learning
url https://cardiovascular.elpub.ru/jour/article/view/4130
work_keys_str_mv AT aeandreychenko developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months
AT adermak developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months
AT dvgavrilov developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months
AT renovitsky developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months
AT omdrapkina developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months
AT avgusev developmentandvalidationofmachinelearningmodelspredictinghospitalizationsofhypertensivepatientsover12months