Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications

Current sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhijie Ning, Yongbo Tie
Format:	Article
Language:	English
Published:	MDPI AG 2025-07-01
Series:	Applied Sciences
Subjects:	landslide susceptibility negative sampling method frequency ratio random forest model
Online Access:	https://www.mdpi.com/2076-3417/15/14/7646
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1839616611619700736
author	Zhijie Ning Yongbo Tie
author_facet	Zhijie Ning Yongbo Tie
author_sort	Zhijie Ning
collection	DOAJ
description	Current sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative samples on the accuracy of landslide susceptibility evaluation models. To overcome this challenge, this study proposes a fuzzy membership-based sampling method for assessing negative sample credibility in the Liangshan Yi Autonomous Prefecture, where credibility is defined as the confidence level of stable nonlandslide samples. Subsequently, negative samples were sampled across stratified credibility thresholds to construct a frequency ratio–random forest coupled model. The influence of negative sample credibility on model performance was then systematically evaluated using various metrics, including the F1-score (metrics for evaluating classification performance), area under the receiver operating characteristic curve (AUC), and actual landslide distribution ratio (landslide proportion) in high-susceptibility zones. The results are as follows: (1) Increasing the credibility threshold progressively improves model precision while inducing systematic overestimation bias in regional susceptibility assessment; (2) Integrated analysis of model performance and landslide distribution characteristics (where recall, F1-score, and AUC values initially increase then decrease) confirms the optimal effectiveness when selecting negative samples within a credibility threshold range of 0.7–1.0. This study innovatively achieves quantitative optimization of negative samples and provides a universal solution for improving the performance of diverse models reliant on negative sampling strategies.
format	Article
id	doaj-art-1a7c9e270d0f47f6b2250b3b5b0f02c0
institution	Matheson Library
issn	2076-3417
language	English
publishDate	2025-07-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj-art-1a7c9e270d0f47f6b2250b3b5b0f02c02025-07-25T13:11:55ZengMDPI AGApplied Sciences2076-34172025-07-011514764610.3390/app15147646Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its ApplicationsZhijie Ning0Yongbo Tie1Chinese Academy of Geological Sciences, Beijing 100037, ChinaChengdu Center of China Geological Survey, Chengdu 610081, ChinaCurrent sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative samples on the accuracy of landslide susceptibility evaluation models. To overcome this challenge, this study proposes a fuzzy membership-based sampling method for assessing negative sample credibility in the Liangshan Yi Autonomous Prefecture, where credibility is defined as the confidence level of stable nonlandslide samples. Subsequently, negative samples were sampled across stratified credibility thresholds to construct a frequency ratio–random forest coupled model. The influence of negative sample credibility on model performance was then systematically evaluated using various metrics, including the F1-score (metrics for evaluating classification performance), area under the receiver operating characteristic curve (AUC), and actual landslide distribution ratio (landslide proportion) in high-susceptibility zones. The results are as follows: (1) Increasing the credibility threshold progressively improves model precision while inducing systematic overestimation bias in regional susceptibility assessment; (2) Integrated analysis of model performance and landslide distribution characteristics (where recall, F1-score, and AUC values initially increase then decrease) confirms the optimal effectiveness when selecting negative samples within a credibility threshold range of 0.7–1.0. This study innovatively achieves quantitative optimization of negative samples and provides a universal solution for improving the performance of diverse models reliant on negative sampling strategies.https://www.mdpi.com/2076-3417/15/14/7646landslide susceptibilitynegative sampling methodfrequency ratiorandom forest model
spellingShingle	Zhijie Ning Yongbo Tie Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications Applied Sciences landslide susceptibility negative sampling method frequency ratio random forest model
title	Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_full	Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_fullStr	Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_full_unstemmed	Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_short	Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
title_sort	sampling method based on fuzzy membership for computing negative sample credibility and its applications
topic	landslide susceptibility negative sampling method frequency ratio random forest model
url	https://www.mdpi.com/2076-3417/15/14/7646
work_keys_str_mv	AT zhijiening samplingmethodbasedonfuzzymembershipforcomputingnegativesamplecredibilityanditsapplications AT yongbotie samplingmethodbasedonfuzzymembershipforcomputingnegativesamplecredibilityanditsapplications

Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications

Similar Items