A framework for predicting zoonotic hosts using pseudo-absences: the case of Echinococcus multilocularis

Identifying the host range of zoonotic parasites is challenging due to limited data and sampling biases. In particular, while more information exists for susceptible hosts, data on resistant species is extremely scant. Echinococcus multilocularis (Leuckart, 1863) (Cestoda: Taeniidae) is the causativ...

Full description

Saved in:
Bibliographic Details
Main Authors: Andrea Simoncini, Dimitri Giunchi, Marta Marcucci, Alessandro Massolo
Format: Article
Language:English
Published: Elsevier 2025-12-01
Series:Ecological Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1574954125003048
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Identifying the host range of zoonotic parasites is challenging due to limited data and sampling biases. In particular, while more information exists for susceptible hosts, data on resistant species is extremely scant. Echinococcus multilocularis (Leuckart, 1863) (Cestoda: Taeniidae) is the causative agent of alveolar echinococcosis, one of the most significant food-borne zoonoses worldwide. Using data on susceptibility and competence of Holarctic cricetid and murid rodents, key intermediate hosts for E. multilocularis, we developed models to predict the likelihood of infection for any rodent species in the Holarctic. These models incorporated morphological and ecological characteristics and employed two approaches: Generalized Linear Models (GLM) and Presence-Unlabeled Learning (PU-L), a machine learning technique. To train the models, we defined pseudo-absences based on the bias in research effort. We compared the two algorithms and selected GLM as the most effective, using it to map potentially susceptible rodent species across phylogeny and geographic space. Predictions identified several potentially unreported hosts, suggesting that the current understanding of E. multilocularis host distribution may underestimate the true risk. The predicted richness of intermediate hosts peaked in Central-Eastern Europe, Western North America and Central Asia, while the ratio of predicted hosts to total rodent richness was highest in the northern latitudes and the Tibetan Plateau. The average temperature in the geographic range and range size emerged as the strongest predictors of host susceptibility. The workflow demonstrates promise for application to other host-parasite systems with unknown host ranges.
ISSN:1574-9541