Predicting the Recurrence of Differentiated Thyroid Cancer Using Whale Optimization-Based XGBoost Algorithm
<b>Background/Objectives:</b> Differentiated Thyroid Cancer (DTC), comprising papillary and follicular carcinomas, is the most common type of thyroid cancer. This is highly infectious and increasing at a higher rate. Some patients experience recurrence even after undergoing successful tr...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-07-01
|
Series: | Diagnostics |
Subjects: | |
Online Access: | https://www.mdpi.com/2075-4418/15/13/1684 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | <b>Background/Objectives:</b> Differentiated Thyroid Cancer (DTC), comprising papillary and follicular carcinomas, is the most common type of thyroid cancer. This is highly infectious and increasing at a higher rate. Some patients experience recurrence even after undergoing successful treatment. Early signs of recurrence can be hard to identify, and the existing health care system cannot always identify it on time. Therefore, predicting its recurrence accurately and in its early stage is a significant clinical challenge. Numerous advanced technologies, such as machine learning, are being used to overcome this clinical challenge. Thus, this study presents a novel approach for predicting the recurrence of DTC. The key objective is to improve the prediction accuracy through hyperparameter optimization. <b>Methods:</b> In order to achieve this, we have used a metaheuristic algorithm, the whale optimization algorithm (WOA) and its modified version. The modifications that we introduced in the original WOA algorithm are a piecewise linear chaotic map for population initialization and inertia weight. Both of our algorithms optimize the hyperparameters of the Extreme Gradient Boosting (XGBoost) model to increase the overall performance. The proposed algorithms were applied to the dataset collected from the University of California, Irvine (UCI), Machine Learning Repository to predict the chances of recurrence for DTC. This dataset consists of 383 samples with a total of 16 features. Each feature captures the critical medical and demographic information. <b>Results:</b> The model has shown an accuracy of 99% when optimized with WOA and 97% accuracy when optimized with the modified WOA. <b>Conclusions:</b> Furthermore, we have compared our work with other innovative works and validated the performance of our model for the prediction of DTC recurrence. |
---|---|
ISSN: | 2075-4418 |