Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications
Current sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-07-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/15/14/7646 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1839616611619700736 |
---|---|
author | Zhijie Ning Yongbo Tie |
author_facet | Zhijie Ning Yongbo Tie |
author_sort | Zhijie Ning |
collection | DOAJ |
description | Current sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative samples on the accuracy of landslide susceptibility evaluation models. To overcome this challenge, this study proposes a fuzzy membership-based sampling method for assessing negative sample credibility in the Liangshan Yi Autonomous Prefecture, where credibility is defined as the confidence level of stable nonlandslide samples. Subsequently, negative samples were sampled across stratified credibility thresholds to construct a frequency ratio–random forest coupled model. The influence of negative sample credibility on model performance was then systematically evaluated using various metrics, including the F1-score (metrics for evaluating classification performance), area under the receiver operating characteristic curve (AUC), and actual landslide distribution ratio (landslide proportion) in high-susceptibility zones. The results are as follows: (1) Increasing the credibility threshold progressively improves model precision while inducing systematic overestimation bias in regional susceptibility assessment; (2) Integrated analysis of model performance and landslide distribution characteristics (where recall, F1-score, and AUC values initially increase then decrease) confirms the optimal effectiveness when selecting negative samples within a credibility threshold range of 0.7–1.0. This study innovatively achieves quantitative optimization of negative samples and provides a universal solution for improving the performance of diverse models reliant on negative sampling strategies. |
format | Article |
id | doaj-art-1a7c9e270d0f47f6b2250b3b5b0f02c0 |
institution | Matheson Library |
issn | 2076-3417 |
language | English |
publishDate | 2025-07-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj-art-1a7c9e270d0f47f6b2250b3b5b0f02c02025-07-25T13:11:55ZengMDPI AGApplied Sciences2076-34172025-07-011514764610.3390/app15147646Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its ApplicationsZhijie Ning0Yongbo Tie1Chinese Academy of Geological Sciences, Beijing 100037, ChinaChengdu Center of China Geological Survey, Chengdu 610081, ChinaCurrent sampling methods do not provide effective quantitative assessment mechanisms for evaluating the intrinsic credibility of negative samples. This impedes the systematic quantification of the effect of misselection of geologically predisposed areas (i.e., potential landslide zones) as negative samples on the accuracy of landslide susceptibility evaluation models. To overcome this challenge, this study proposes a fuzzy membership-based sampling method for assessing negative sample credibility in the Liangshan Yi Autonomous Prefecture, where credibility is defined as the confidence level of stable nonlandslide samples. Subsequently, negative samples were sampled across stratified credibility thresholds to construct a frequency ratio–random forest coupled model. The influence of negative sample credibility on model performance was then systematically evaluated using various metrics, including the F1-score (metrics for evaluating classification performance), area under the receiver operating characteristic curve (AUC), and actual landslide distribution ratio (landslide proportion) in high-susceptibility zones. The results are as follows: (1) Increasing the credibility threshold progressively improves model precision while inducing systematic overestimation bias in regional susceptibility assessment; (2) Integrated analysis of model performance and landslide distribution characteristics (where recall, F1-score, and AUC values initially increase then decrease) confirms the optimal effectiveness when selecting negative samples within a credibility threshold range of 0.7–1.0. This study innovatively achieves quantitative optimization of negative samples and provides a universal solution for improving the performance of diverse models reliant on negative sampling strategies.https://www.mdpi.com/2076-3417/15/14/7646landslide susceptibilitynegative sampling methodfrequency ratiorandom forest model |
spellingShingle | Zhijie Ning Yongbo Tie Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications Applied Sciences landslide susceptibility negative sampling method frequency ratio random forest model |
title | Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications |
title_full | Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications |
title_fullStr | Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications |
title_full_unstemmed | Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications |
title_short | Sampling Method Based on Fuzzy Membership for Computing Negative Sample Credibility and Its Applications |
title_sort | sampling method based on fuzzy membership for computing negative sample credibility and its applications |
topic | landslide susceptibility negative sampling method frequency ratio random forest model |
url | https://www.mdpi.com/2076-3417/15/14/7646 |
work_keys_str_mv | AT zhijiening samplingmethodbasedonfuzzymembershipforcomputingnegativesamplecredibilityanditsapplications AT yongbotie samplingmethodbasedonfuzzymembershipforcomputingnegativesamplecredibilityanditsapplications |