Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications

Current sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhijie Ning, Yongbo Tie
Format: Article
Language:English
Published: MDPI AG 2025-07-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/14/7646
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839616611619700736
author Zhijie Ning
Yongbo Tie
author_facet Zhijie Ning
Yongbo Tie
author_sort Zhijie Ning
collection DOAJ
description Current sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative samples on the accuracy of landslide susceptibility evaluation models. To overcome this challenge, this study proposes a fuzzy membership-based sampling method for assessing negative sample credibility in the Liangshan Yi Autonomous Prefecture, where credibility is defined as the confidence level of stable nonlandslide samples. Subsequently, negative samples were sampled across stratified credibility thresholds to construct a frequency ratio–random forest coupled model. The influence of negative sample credibility on model performance was then systematically evaluated using various metrics, including the F1-score (metrics for evaluating classification performance), area under the receiver operating characteristic curve (AUC), and actual landslide distribution ratio (landslide proportion) in high-susceptibility zones. The results are as follows: (1) Increasing the credibility threshold progressively improves model precision while inducing systematic overestimation bias in regional susceptibility assessment; (2) Integrated analysis of model performance and landslide distribution characteristics (where recall, F1-score, and AUC values initially increase then decrease) confirms the optimal effectiveness when selecting negative samples within a credibility threshold range of 0.7–1.0. This study innovatively achieves quantitative optimization of negative samples and provides a universal solution for improving the performance of diverse models reliant on negative sampling strategies.
format Article
id doaj-art-1a7c9e270d0f47f6b2250b3b5b0f02c0
institution Matheson Library
issn 2076-3417
language English
publishDate 2025-07-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-1a7c9e270d0f47f6b2250b3b5b0f02c02025-07-25T13:11:55ZengMDPI AGApplied Sciences2076-34172025-07-011514764610.3390/app15147646Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its ApplicationsZhijie Ning0Yongbo Tie1Chinese Academy of Geological Sciences, Beijing 100037, ChinaChengdu Center of China Geological Survey, Chengdu 610081, ChinaCurrent sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative samples on the accuracy of landslide susceptibility evaluation models. To overcome this challenge, this study proposes a fuzzy membership-based sampling method for assessing negative sample credibility in the Liangshan Yi Autonomous Prefecture, where credibility is defined as the confidence level of stable nonlandslide samples. Subsequently, negative samples were sampled across stratified credibility thresholds to construct a frequency ratio–random forest coupled model. The influence of negative sample credibility on model performance was then systematically evaluated using various metrics, including the F1-score (metrics for evaluating classification performance), area under the receiver operating characteristic curve (AUC), and actual landslide distribution ratio (landslide proportion) in high-susceptibility zones. The results are as follows: (1) Increasing the credibility threshold progressively improves model precision while inducing systematic overestimation bias in regional susceptibility assessment; (2) Integrated analysis of model performance and landslide distribution characteristics (where recall, F1-score, and AUC values initially increase then decrease) confirms the optimal effectiveness when selecting negative samples within a credibility threshold range of 0.7–1.0. This study innovatively achieves quantitative optimization of negative samples and provides a universal solution for improving the performance of diverse models reliant on negative sampling strategies.https://www.mdpi.com/2076-3417/15/14/7646landslide susceptibilitynegative sampling methodfrequency ratiorandom forest model
spellingShingle Zhijie Ning
Yongbo Tie
Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
Applied Sciences
landslide susceptibility
negative sampling method
frequency ratio
random forest model
title Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_full Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_fullStr Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_full_unstemmed Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_short Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_sort sampling method based on fuzzy membership for computing negative sample credibility and its applications
topic landslide susceptibility
negative sampling method
frequency ratio
random forest model
url https://www.mdpi.com/2076-3417/15/14/7646
work_keys_str_mv AT zhijiening samplingmethodbasedonfuzzymembershipforcomputingnegativesamplecredibilityanditsapplications
AT yongbotie samplingmethodbasedonfuzzymembershipforcomputingnegativesamplecredibilityanditsapplications