Calculation of Sensitivity and Specificity from Partial Data for Meta-Analyses: Introducing Some Practical Methods

Introduction: Meta-analyses of diagnostic/prognostic studies for calculating the pooled sensitivity and specificity require true positive (TP), true negative (TN), false positive (FP), and false negative (FN) counts. However, few studies report these values directly. This study aimed to consolidate...

Full description

Saved in:
Bibliographic Details
Main Authors: Reihanesadat Khatami, Mohammadsadegh Faghihi, Hannanesadat Khatami, Mahmoud Yousefifard, Seyedhesamoddin Khatami
Format: Article
Language:English
Published: Shahid Beheshti University of Medical Sciences 2025-06-01
Series:Archives of Academic Emergency Medicine
Subjects:
Online Access:https://journals.sbmu.ac.ir/aaem/index.php/AAEM/article/view/2678
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Introduction: Meta-analyses of diagnostic/prognostic studies for calculating the pooled sensitivity and specificity require true positive (TP), true negative (TN), false positive (FP), and false negative (FN) counts. However, few studies report these values directly. This study aimed to consolidate practical methods to reconstruct sensitivity and specificity from minimal data. Methods: Our framework addresses three main situations: (1) algebraic rearrangements to compute specificity given partial metrics; (2) digitization of receiver operating characteristic (ROC) curves to obtain threshold-specific sensitivity and specificity; and (3) application of the binormal model when only AUC and prevalence are available. We tested these methods on a dataset related to mortality prediction in myocardial infarction (MI) using machine learning models, assessing how well they reconstructed sensitivity and specificity. Results: Algebraic formulas and ROC digitization yielded reliable estimates when partial metrics or graphical curves were sufficiently detailed. However, the binormal model, which assumes equal variances, showed noticeable inaccuracies, especially for sensitivity. Linear regression analyses indicated that higher prevalence and higher AUC reduced estimation errors. Conclusion: These methods offer practical alternatives for reconstructing diagnostic accuracy measures when data are incomplete. Relying solely on AUC-based estimations may introduce substantial bias, particularly in low-prevalence contexts. We recommend that primary studies report threshold-specific sensitivity and specificity to support more accurate meta-analytic estimations.
ISSN:2645-4904