Hybrid model for cleaning abnormal data of wind turbine power curve based on machine learning approaches
This paper addresses important challenges in wind energy prediction caused by outliers in wind data, which distort the wind turbine power curve and lead to inaccurate performance assessments and suboptimal operation strategies. The major difficulty here is detecting and eliminating these outliers fr...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-09-01
|
Series: | e-Prime: Advances in Electrical Engineering, Electronics and Energy |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2772671125001500 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | This paper addresses important challenges in wind energy prediction caused by outliers in wind data, which distort the wind turbine power curve and lead to inaccurate performance assessments and suboptimal operation strategies. The major difficulty here is detecting and eliminating these outliers from complex wind datasets, as inaccurate data can significantly impact forecasting and related activities. To overcome this challenge, the paper proposes a hybrid model combining fuzzy C-means clustering, Mahalanobis distance, and Artificial Neural Networks (ANN) to detect and remove outliers far more accurately than any individual method or other traditional hybrid method, decreasing false alarms and misses. It improves data quality and boosts the reliability of turbine performance analysis, resource assessment, and forecasting, supporting more efficient and sustainable wind-power operations. The results show (1) that the proposed hybrid model achieves 15.4 % more accuracy than the other traditional hybrid models in detecting and removing outliers. (2) The proposed hybrid model gives an overall ≈ 116.1 % improvement in outlier-detection accuracy over the individual models. (3) Adding the ANN to the proposed hybrid model boosts the outlier-detection accuracy to about a 69.5 % relative improvement. (4) Detecting and cleaning outliers by the proposed hybrid model cuts the RMSE from 2.38 to 1.27, reducing prediction error by 46.6 %. (5) The advanced hybrid model used in this study for comparison purposes achieves nearly identical accuracy to the proposed hybrid model; it reduces RMSE by ∼0.015 and MAPE by ∼0.04 pp and boosts R² by ∼0.001 while maintaining almost perfect outlier detection (99 % vs. 100 %). Although the advanced model offers a marginal edge in reconstruction quality, the lightweight, scalable proposed hybrid model remains better appropriate for real-world deployment due to its lower computational overhead and more straightforward maintenance. |
---|---|
ISSN: | 2772-6711 |